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A NEW MEASURE OF THE DEVELOPMENT OF THE 
CARPAL BONES AND ITS RELATION TO 
PHYSICAL AND MENTAL 
DEVELOPMENT 


FRANK N. FREEMAN 
University of Chicago 
AND 
THOMAS M. CARTER 
Albion College 


THE PROBLEM 


The investigation which is reported in this article is a study of 
physiological development and its relation to mental development.' We 
may introduce the report of the study by a comment on its relation 
to the previous investigation of this problem. 

The study of physiological development has gone through two 
general stages. The first stage consisted of an investigation of the 
general course of development by plotting curves of age growth based 
upon the medians of successive ages. This kind of study is represented 
in the familiar growth curves in weight, height, vital capacity, arm 
span, and the like. 

This type of measure of development meets the difficulty that one 
cannot readily interpret the stage of development of individuals by 
comparison with such a curve. If the height of an individual of a 
certain age is above the median height for that age we are unable to 
determine to what extent this superiority is due to precocity in develop- 
ment on the one hand, or to the fact that the individual will be taller 
than the average at maturity on the other hand. In other words, indi- 


1Summary of part of a study reported by Thomas M. Carter in a doctor's 
dissertation which is on file in the library of the University of Chicago. 
257 
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viduals differ in final size and in rate of development, and the mere 
comparison of individual measures with median growth curves does 
not enable us to distinguish these two factors. 


Another way of putting the difficulty is to say that the composite . 


growth curve which is made up from the medians includes within it a 
variety of types of growth curve and is to be regarded not as a repre- 
sentative of typical growth, but as a mere mathematical average of a 
number of types. As such, it is of little value as a means of individual 
diagnosis. 

Because of the defects of the general growth curve which have been 
mentioned, it cannot readily be used as a means of determining the 
correlation between individual differences in the time or rate of 
physical maturing and individual differences in mental maturing or in 
intelligence. Some means must be found to distinguish between the 
two aspects of growth, namely: Final size and stage of maturity. 

It may be remarked, parenthetically, that we meet a similar diffi- 
culty in our interpretation of measures of intelligence. It is custom- 
ary to identify mental maturity with superiority in intellectual 
capacity. We have no means of distinguishing between the level of 
capacity which the individual will ultimately reach and the rate of 
maturing of intellectual capacity. This is a problem on which light 
may possibly be thrown by repeating measurements of children up to 
maturity. 

To return to the discussion of physical measures, we come to the 
second stage in investigation. In this stage the attempt is made to 
determine individual rates of maturing independently of final size. 
The first method which was attempted was the study of individual 
growth curves and the identification of periods of acceleration which 
represent definite stages of maturing. Baldwin’s survey of individual 
growth curves was the first wide scale attack upon this problem. This 
identification of stages of development requires possession of a series 
of repeated measurements on the same individual. There is some cor- 
relation between final size and the periods at which stages of 
development occur, and final size can be predicted fairly closely from 
the size at birth; but this correlation between size and rate of develop- 
ment is not close enough to be used as the basis of prediction. 

It is therefore desirable to find other measures of physical maturity. 
This has led to the attempt to find in the ossification of the bones a 
maturity measure. The study by Rotch, Bean, Woodrow, and Bald- 
win, of the ossification of the carpal bones was for the purpose of dis- 
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covering in it the required measure. This study was at first pursued 
by the inspection method. The stages of development are distin- 
guished by comparing the radiographs of the carpal bones of individuals 
with radiographs which are taken to represent the medians for the 
successive ages. The distinguishing marks of maturity are found in 
the appearance and the size of the bones. There is no means of 
determining which elements are most important as indices of maturity 
by this method, nor is it possible because of the lack of quantitative 
measurement to obtain a verification of the method. 

An advance over the inspection method was made by Baldwin when 
he measured by means of a planimeter the area of projection of the 
carpal bones in the radiograph. This method meets one of the diffi- 
culties in that it makes the measure quantitative. It has another 
difficulty, however, in probably greater degree than the inspection 
method, on account of the fact that, like the earlier measues of height, 
weight, etc., it does not distinguish between size, and rate of maturing. 
That this is a serious difficulty will appear from the data of the present 
study which will be reported in a later paragraph. 


METHOD OF PRESENT STUDY 


This study is concerned, first, with a method of measurement by 
means of which the difficulties which have just been described might 
be overcome and with its verification by a comparison of the growth 
curves of boys and girls. It is concerned, in the second place, with the 
use of this method for the study of certain aspects of physical develop- 
ment and for the study of the relation between physical and mental 
development. 

We are concerned first, then, with the method of measurement and 
with the examination of its validity. It was soon discovered from the 
study of the measurements of the areas of the projection of the carpal 
bones that this area was the product of two factors and that unless 


these factors were distinguished, this measure could not be interpreted. ! 


In order to distinguish these factors, it is necessary to secure an inde- 
pendent measure of general skeletal size. This was done by finding 
the area of a quadrilateral which was made by joining specified points 
on the bones at the extremities of the bones of the arm and of the hand. 
These parts will be specified more particularly in a moment. This 
area, as will be shown in a later article, was found to represent with 
considerable accuracy the general size of the individual. The assump- 
tion which was made as a starting point was that the size of the carpal 
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bones would be influenced by the general size of the skeleton and that 
this could be discounted by the use of the area of the above mentioned 
quadrilateral as a term in a ratio. . Specifically, the area of the carpal 
quadrilateral was used as the denominator of a fraction. The stage of 
development of the carpal bones was then represented by finding the 
ratio of the sum of the areas of the carpal bones to this carpal quadri- 
lateral. To sum up then, we measured first the area of a quadrilateral 
made by drawing lines from fixed points in the wrist and calling this 
area the carpal quadrilateral. We then found the sum of the areas of 
the projections of the carpal bones and called this the total ossification. 
We then found the ratio of the total ossification to the carpal — 
lateral and called this the ossification ratio. 


TECHNIQUE OF MEASUREMENT 


It is readily recognized that there are three dimensions of the bones 
in question. For a complete knowledge of the amount of ossification 
present in any given unit the investigator should know all three of 
these dimensions. One is, however, limited if he uses only a single 
radiograph to two of these dimensions, namely length and breadth, 
since thickness cannot be skiagraphically determined. If the bones 
were regular in shape, the two dimensional diameters when considered 
together would afford as accurate a method of determining size of 
skiagraph as could be obtained. But the bones were found to be quite 
irregular. The diameters, therefore, might differ greatly while the 
areas of the two bones would be the same. We rejected, accordingly, 
the diameter method of determining the size of the bones. 

It seemed evident that what was needed was some method which 
would measure the entire exposed area of the ossified surface. A 
planimeter was then secured and when its accuracy had been thor- 
oughly tested experiments were made to determine its adaptability 
to our needs. 

In order to test the accuracy with which this method of measuring 
could be used in our investigation, one of the authors selected 50 
skiagraphs and measured them twice and calculated the coefficient of 
correlation between the two series of measures. That the manipulator 
might not be influenced in the second measurement by the results 
obtained in the first measurement, the entire series of bones was 
measured for the first time and then for the second time. To guard 
further against bias which might result from knowing what measurements 
were obtained, the planimeter readings were dictated to another person 
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and the exact measures deduced from the readings were not known by 
the manipulator until the two series of planimeter tracings had been 
completed. The coefficient of correlation obtained was .987 + .009. 
The results obtained were as accurate as could be expected and the 
method was adopted as satisfactory for determining the size of the 
skiagraphs of the bones. 

The skiagraphic area of the bones was measured from the films 
rather than from the prints. For illuminating the films from beneath, 
a box, 11 inches wide, 10 inches deep and 22 inches long over all, was 
constructed and provided with electric fixtures for two lamps. The 
box was enameled white inside and the top was made of frosted glass. 
Such an arrangement provided ample illumination, but the heat pro- 
duced by the two lamps was greater than the films could stand. 
Accordingly a piece of plate glass the size of the top of the box and 
raised from the frosted glass top by supports at each end about an 
inch high was added to the apparatus. A sufficient air chamber was 
thus provided between the box and the plate glass so that the heat 
was no longer a disturbing element. Not only did this device solve the 
problem of heat, but it also provided an easy arrangement for anchor- 
ing the films and holding them steadfast while the measurements were 
being made. The films were fastened to the piece of glass by strong 
spring clamps. 


THe VALIDITY OF THE OSSIFICATION RATIO AS A 
MEASURE OF MATURITY 


The validity of this measure is indicated by a study of the sex 
differences in the three measures, namely: The carpal quadrilateral, 
the total ossification, and the ossification ratio. The comparison 
between the age curves for boys and girls in these three measures fit 
exactly our hypotheses and what we should expect in the light of our 
knowledge of sex differences in growth and our assumptions concerning 
these three measures. The facts are shown in the three charts. 

Fig. 1 shows the growth curves of boys and girls separately in the 
carpal quadrilateral, which is taken to be a measure of skeletal size. 
The characteristic growth phenomena are here well represented. The 
boys’ measures are larger than those of girls, as we find in the case of 
height and weight. The acceleration in case of girls comes earlier than 
in the case of boys. The number of cases for each age (20 for each sex) 
is not large enough to give a perfectly smooth curve, but it will be seen 
that the curve for girls approaches that for boys in the pre-adolescent 
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period but that the boys’ curve diverges from that of girls beginning 
with 11 years. At this point the rate of growth for girls begins to 
diminish. 
FIGURE 1. 
CARPAL QUADRILATERAL CURVES FOR BOYS AND GIRLS 5-17 YEARS OF AGE 


. a oe ee ee ee ee ee ee a 





3. 





325 Ps 











275 eo 


ts [ 





225 L ,. 








175 V4 A | 





150 / 


















































125 | eae 


Contrast these curves with the curves for ossification ratio of boys 
and girls shown in Fig. 2 Here the curve for girls is above that for 
boys at every age, as we should expect. Furthermore, the pubertal 
acceleration comes a year earlier for girls than for boys, namely at 10 
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for girls and 11 for boys. The development of boys, however, con- 
tinues longer than that for girls and the appearance of the curves sug- 
gests that they would come together at some later age, probably 19 


FIGURE 2. 
OSSIFICATION RATIO CURVES FOR BOYS AND GIRLS 5-17 YEARS OF ACE 
120 eee ae coe et! ae ae Oo Se! oe | er, 





110 


190 y Fé 


a 
/ 
/ 
/ 
i / 
90 ‘ A 
/ 
/ 
/ 
/ 
50 Z / 


70 } af 























> ae 
7 4 
iq 
‘ 
/ 
; 
f 
60 j 
/ 
/ 
/ 
i 
/ 
, 
y 
50 Fi 
- 
4 
/ 
/ 
/ 
/ 
x 
7 
40 Z / 
‘4 4 





x0|_ | 
































20 





























or 20. The girls show a rate of development which is from one to two 
years in advance of that of the boys. All this agrees exactly with our 
general evidence concerning the relation between the rate of maturing 
of the two sexes. 
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Contrast the curves of the ossification ratio with the curves of 
total ossification shown in Fig. 3. The curves for boys and girls cross 
and re-cross until age 14. The irregularities in the curve are probably 
due in part to the limited number of cases, but the general fact up to 


* FIGURE 3. 
TOTAL OSSIFICATION CURVES FOR a AND GIRLS 5-17 YEARS OF AGE 
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adolescence is that, so far as this measure is an indication, the develop- 
- ment of boys and girls is practically alike. This we know to be con- 
trary to the facts and therefore a total ossification is not a suitable 
measure so far as the comparison between boys and girls is concerned. 








Measure of Development of Carpal Bones 265 


If the ossification ratio is valid as a comparative measure of the 
development of boys and girls by virtue of the fact that it enables us 
to discount differences in size and to obtain a measure of development 


FIGURE 4. 
OSSIFICATION RATIO CURVES FOR TALL AND SHORT BOYS 7-17 YEARS OF AGE 
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unaffected by size, we may infer that it will also serve as a satisfactory 
means of measuring the relative maturity of individuals of the same sex 
regardless of their differences in size. 











: 
z, 
a 
| 
i 


266 The Journal of Educational Psychology 


PRELIMINARY RESULTS FROM THE USE OF THE OSSIFICATION RATIO 
AS A MEASURE OF MATURITY 


Some of the results of the use of this measure with respect to physio- 
logical development have already been mentioned in commenting on 


FIGURE 5. 
OSSIFICATION RATIO OURVES FOR TALL AND SHORT GIRLS 7-17 YEARS OF ACE 
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the curves of the ossification ratio. The stages of development appear 
to be comparable to those which we find when we measure the accelera- 
tion in height. Apparently the acceleration in development of the 
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carpal bones runs parallel to the acceleration in general physical 
growth. The most interesting feature of these curves as measures of 
development is that they give an appearance of ultimately coming 
together. If it is a true measure of development we should expect 
this union of the curves to occur. It is further interesting to note that 
the point at which development ceases and at which the curves will 
unite is apparently several years beyond age 17. 

The measurements were classified in order to bring out a com- 
parison between tall and short individuals of the same sex. It will be 
remembered that Baldwin has given evidence to show that tall indi- 
viduals, in general, develop earlier than short individuals. In order 
to get evidence upon this point the age curves for each sex were classi- 
fied by height and the ossification ratio curves for the tall and the short 
individuals were plotted separately. As isapparent from Figs.4and 5, 
the tall individuals are somewhat earlier in development than the short 
individuals, although the difference is not great. The difference in the 
case of boys is not apparent after year 11. It is more uniform in the 
case of girls and appears up to the age of 14. In the case of the girls 
the average difference between the tall and short individuals is in the 
neighborhood of a half year’s development. 

The last comparison to be mentioned is between physical and 
mental development. The data on this point are provisional and are 
subject to further analysis from a study of individual growth curves on 
the side of both physical and mental development. The data which 
we have to present at the present time are in the form of correlations. 
The following table forms the basis of discussion. 


CORRELATIONS BETWEEN VARIOUS MEASURES OF PHYSICAL AND MENTAL 


DEVELOPMENT 

Bors GIRLs 
Mental age and ossification ratio..................... .73 .75 
Mental age and chronological age..................... .82 . 83 
Chronological age and ossification ratio................ 85 .87 
Grade location and anatomical ratio.................. .86 . 89 
Grade location and chronological age.................. .93 .94 
Partial correlation with age constant.................. 
Mental age and ossification ratio..................... .084 .088 


One or two comparisons between these correlations may be pointed 
out. Chronological age seems to have a closer correlation with mental 
age than does the ossification ratio. This would seem at first sight to 
contradict the opinion that mental development corresponded more 
closely to physiological maturity than it does to age. At any rate, it 
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seems to make it improbable that we may find a measure of physiologi- 
cal maturity which will give a better base upon which to calculate 
intelligence than is furnished by chronological age. It may possibly 
be that the closer correlation between chronological age and mental 
age is due to the fact that mental age has not been analyzed into the 
two factors of degree of maturity and final intelligence, though it is not 
entirely clear why this should favor the correlation with chronological 
age rather than with anatomical ratio. This is a question which 
demands further investigation. 

The closer relation between grade location and chronological age 
than between grade location and anatomical ratio may be due to the 
fact that chronological age is used as a basis for entrance to school and 
for promotion. In both cases the correlation is high. 

When we partial out age and find the correlation between mental 
age and ossification ratio with chronological age constant, we get 
practically a zero correlation. This coefficient agrees closely with that 
found by Baldwin and Stecher, (p. 57) which was .09.1_ This means 
that those individuals who are precocious in physiological development 
are not the individuals who are high in intelligence in comparison with 
others of the same age.’ 

Again we have the difficulty in interpretation due to the fact that 
the measure of mental age is a composite. Ossification ratio is a 
measure of the distance which the individual has traveled along the 
road to maturity, or precocity. Mental age, on the other hand, is a 
measure both of the distance the individual has traveled and of the 
level of the road along which he is traveling—of precocity and bright- 
ness. It is a composite measure of the two and we do not know in 
any case how much it may be affected by each of these two factors. 
In such a case our interpretation of the correlation between ossification 
ratio or anatomical maturity and mental age is somewhat involved, 
but the following inference seems to be sound. 


1 Baldwin, B. T. and Stecher, Lorle I.: “‘Mental Growth Curve of Normal 
and Superior Children, Iowa City.’’ University of Iowa Studies, Vol. II, No. 1, 
1922. 

2Since the above was written, the writers have received the monographs of 
Prescott in which very low partial correlations between anatomical index and 
mental age with chronological age constant are reported. For four groups the 
coefficients are, respectively, —.05, .12, .30, .33. See Prescott, D. A.: “The 
Determination of Anatomical Age in School Children and Its Relation to Mental 
Development.’”’ Cambridge, Harvard Monographs in Education, 1923, Series 
1, No. 5. 
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Suppose that anatomical precocity is correlated with mental pre- 
cocity. Anatomical precocity should then be correlated with mental 
age. We have seen that mental age is made up of the two factors of 
brightness and precocity. Anatomical precocity may or may not be 
positively correlated with brightness. It is almost certainly not nega- 
tively correlated with it. If, then, anatomical precocity is correlated 
with one of the two factors in mental age—viz. mental precocity, and 
if its correlation with the other is at least not lower than zero, it should 
be positively correlated with the composite of the two. Since anatomi- 
cal precocity is not correlated with the composite (mental age) it 
cannot be correlated with mental precocity, which is one of the two 
factors in this composite. 

The same argument applies to the other factor of mental age— 
brightness. Anatomical precocity appears not to be correlated with 
it either. 

These conclusions seem at first sight to be contrary to certain 
fairly well established facts. A number of studies, culminating on the 
recent extensive survey by Terman, indicate that bright and scholasti- 
cally advanced children are taller and heavier than are dull and 
retarded children. Upon analysis, however, it will be seen that fact 
is not necessarily contradictory to our conclusions. Size and weight 
themselves are the product of two factors, final size and rate of growth. 
Only rate of growth corresponds to anatomical precocity. The facts 
do not prove that rate of physical growth is correlated with the rate of 
mental growth. We have been inclined to interpret our facts as indi- 
cating such a correspondence, but the lack of correlation between ossi- 
fication ratio and mental age at least casts grave doubt upon this 
interpretation. 

A word as to the practical bearing of these theoretical conclusions. 
Our data seem to indicate that individual variations in the rate of 
skeletal development are independent of individual differences in the 
rate of mental development. This fact bears, first, upon our method 
of reckoning intelligence or brightness. Our method has been to 
reckon intelligence with reference to chronological age as a base—as by 
finding the ratio of mental age to chronological age. It has been pro- 
posed that we substitute some measure of physical development—as 
anatomical age—for chronological age, in this calculation. It seems 
quite clear from our data that we should not be justified in making such 
a substitution. There is an error in our present calculations, but there 
would be a larger error in the proposed calculation, because mental 
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age is more closely correlated with chronological age than it is with 
ossification ratio. 

Again, it has been suggested that children be classified in school on 
the basis of their physical development, as measured by such an index 
as ossification ratio. Two issues are involved, the adjustment of 
school organization to social development, on the one hand, and to 
intellectual development, on the other hand. Promotion according to 
anatomical development is proposed on the ground of its relationship 
both to social and to intellectual development. We are here concerned 
only with the latter. Our data show that the child’s capacity to do 
advanced intellectual work cannot be measured by his rate of anatomi- 
cal development. The child’s variation in mental development must 
be measured by mental tests, and it cannot be inferred from the results 
of physical tests. Classification should be based primarily on measures 
of intellectual growth and capacity, and only secondarily, if at all, 
upon measures of physical growth and capacity. 
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JOHNSON’S MEASUREMENT OF RATE OF 
IMPROVEMENT UNDER PRACTICE 


JOSEPH PETERSON 
George Peabody College 


An important though not uncommon error lurks in the data and 
arguments which Johnson presents, in his very interesting papers.' His 


view 
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DAYS OF PRACTICE 


Fic. 1.—A and A’ show improvement in learning of the 30 subjects who had scored 
above the group average in the intelligence tests and of the 30 who had scored below 
average, respectively. Units give the average number of words per day’s practice of 10 
minutes, in terms of hundreds. Curves ¢ and t’ show for these groups in the same order 
the time required to read 1000 words, units being in 5-minute periods. 


that may or may not be true, however, according to how special the 
field of training is. Positive correlations between intelligence test 


scores and improvement in ability to read “were obtained by measuring 


_- 





t Johnson, O. J.: A Study of the Relation between Ability to Learn and Intelli- 


gence as Measured by Tests. Journal Educational Psychology, Vol. XIV, 1923, 
pp. 540-544. 
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the improvement in terms of the average number of words read during 
the first three days and the last three days of practice” (p. 451). As 
he unfortunately does not publish the individual scores we can not 
make the specific corrections desirable in his correlations. His curves 
show that the difference between the mean scores of the brighter and 
the duller groups, as determined by the tests, increases with practice. 
This he takes as evidence that the brighter subjects improve more 
rapidly; but it is of course obvious that if instead of using the amount 
of work done in 10 minutes as his measure he had taken the time required 
to do a given amount of work, his results would have been just reversed. 
Plotted in this manner the curves of the means of the two groups would 
have approached each other as practice increased. This is clearly 
seen in Fig. 1, where we have reproduced Johnson’s amount of work, 
or average attainment curves, a and a’, representing the upper and the 
lower group, respectively; and where we have also added the time 
curves, ¢ and ?t’, which represent the average number of minutes 
required to read 1000 words by the groups in the order given. The 
upper group consisted of the 30 subjects scoring above the average of 
the entire 60 individuals, and the lower group consists of the 30 scoring 
below this average. The data for these curves have been taken from 
Johnson’s curves, as he gives no tables of his data, but measurements 
were made with the utmost care. The data which we thus obtained 
are given in Table I. Any slight deviation between these data, thus 
obtained, and those of Johnson’s experiment if he has preserved them, 
will make no difference in the conclusions and results here offered. 


TaBLE 1.—SuccesstvE MEAN Scores IN Mirror READING AND THE RATIOS oF 
Eacn MEAN TO THE ONE IMMEDIATELY PRECEDING IT 


TRIALS 
1 2 3 4 5 6 7 8 9 10 
Average number of words per 10 minutes 
Upper group..... 223 365 442 484 527 587 597 652 690 740 
Lower group..... 130 246 292 346 364 424 437 460 488 622 
Average number of minutes to read 1000 words 
Upper group..... 44.8 27.4 22.6 20.7 19.0 17.0 16.7 15.5 14.5 13.5 
Lower group..... 76.9 40.7 34.2 28.9 27.5 23.6 22.9 21.7 20.5 19.2 
Ratios of “average number of words” scores 

Upper group..... .... 1.64 1.21 1.10 1.00 1.11 1.08 1.00 1.06 1.07 
Lower group..... .... 1.89 1.19 1.18 1.05 1.16 1.06 1.05 1.06 1.07 


Ratios of ‘“‘average number of minutes’’ scores 


Upper group..... .... .61 .82 91 .92 .90 .98 .92 .94 .93 
Lower group..... .... . 53 . 84 .85 .95 . 86 .94 .95 .94 .93 
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Taras 
11 12 13 14 15 16 17 18 19 20 
Average number of words per 10 minutes 


Upper group..... 772 824 874 883 925 944 998 1042 1068 1184 
Lower group..... 540 593 622 637 657 668 685 747 785 797 
Average number of minutes to read 1000 words 
Upper group..... 13.0 12.1 11.4 11.3 10.8 10.6 10.0 9.6 9.4 8.4 
Lower group..... 18.5 16.9 16.1 15.7 15.2 15.0 14.6 13.4 12.7 12.5 
Ratios of “‘average number of words’’ scores 
Upper group..... 1.04 1.07 1.06 1.01 1.05 1.04 1.06 1.04 1.02 1.11 
Lower group..... 1.03 1.10 1.05 1.02 1.03 1.02 1.02 1.09 1.05 1.01 
Ratios of ‘‘average number of minutes” scores 
Upper group..... . 96 .93 .94 .99 .95 - 96 94 . 96 .98 .90 
Lower group..... .97 .91 .95 .98 .97 .98 .98 .92 .95 .99 


It is clear that the time curves taken on their face value, as Johnson 
has taken the average attainment curves, would indicate just the oppo- 
site conclusion from that which he drew. Which kind of data then 
shall we use? Obviously neither, in the light of this contradiction, if 
we are to be safe in our conclusions. 

Taking Johnson’s own curves, a and a’, we should certainly find it 
much fairer to determine the superiority of the better group average 
over the poorer one in terms of percentage, and then to compare the 
final with the initial difference in this respect. By this method we find 
that the superior group starts with a mean number of words per 10 
minutes that is 49 per cent greater than the mean of the poorer group, 
when the first three trials are considered but they have a mean supe- 
riority in the average of the last three trial scores—the eighteenth, nine- 
teenth, and twentieth —of only 41 per cent. It would seem fair to 
measure improvement in this manner, rather than in absolute gains, 
since improvement means gain over something as a starting point: 
And this method involves us into no such inconsistency with the time 
curves, or the means of time scores, as does the method used by 
Johnson. These figures, then, contradict Johnson’s conclusion that 
the upper group improved more than the lower group. So also we 
find that the increase in the upper group curve is 415 per cent while 
that of the lower group, curve a’, is 514, the increase being the gain over 
the point of beginning, or over the first mean score, in the case of each 
group. If, to get more stable measures for comparison, we take the 
average of the first three and of the last three trials, we get respective 
increases of the upper and the lower groups of 209 and 249 per cent. 
This method also shows greater improvement of the lower group. 
Moreover, the harmonic mean of the increase of each trial, from the 
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second to the twentieth, over its preceding trial in terms of mean 
number of words per 10 minutes—that is, the harmonic mean of 19 
increments—is 32.8 words for the upper group and 23.6 for the lower 
group. These means are increases over the respective starting average 
scores of 14.7 per cent for the upper group and of 18.2 per cent for the 
lower group of subjects.! All these data show that the lower group, 
the 30 subjects scoring below average in the intelligence tests, made 
more improvement in the learning than did the upper group of 30 
subjects. ) 

The geometric mean of the percentage increments or decrements 
shown in Table I, that is the nineteenth root of the product of the 19 
percentages, would seem to be a fair measure of the improvement, in 
as much as it is the mean percentage increase of each stage of learning 
over the preceding stage. In average number of words per 10 minutes 
the geometric mean of the improvement of the upper group is 9.18 
per cent as against a mean improvement of 10.01 per cent for the lower 
group. The corresponding geometric means of the decrease in average 
number of minutes to read 1000 words are 8.4 per cent and 9.1 percent. 
These means also show greater gain for the lower group, contrary to 
Johnson’s conclusions. 

It is instructive to examine the actual ratios from which these 
geometric means have been found. These ratios are given in Table I. 
These data show more rapid improvement by the lower than by the 
upper group in the early part of the learning. Another way? to bring 
out this same fact of more rapid initial gain by the group scoring below 
average in the intelligence test is to determine which group reaches first 
the geometric mid-point: of its own entire improvement, or of the 
improvement indicated by the final mean score as compared with the 
initial mean score. This mid-point is obtained by taking the square 
root of the product of the first and last mean of each group. We thus 
obtain a mid-point of improvement score for the upper group of 513.84 
words in 10 minutes, and for the lower group of 321.88 words in 10 
minutes for the lower group. Now, while there are technical objec- 
tions to taking these as the actual mid-points of improvement—a 


1 The percentage of the lower group passing the median or the mean of the 
upper group would be a good measure to use, but we have not the necessary data 
for this method. If the upper group is learning faster than the lower one, a smaller 
percentage of the latter group would pass the average of the upper group toward the 
end than at the beginning of the experiment. 

2 For this suggestion and for other aid in the preparation of this paper I am 
indebted to my son, J. Kimbark Peterson. 
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position that would commit us to the view that the learning actually 
goes on much more rapidly in the early part of the process than in 
the latter part—the scores thus obtained at least represent comparable 
stages in the process. An examination of Table I shows that the lower 
group reached their “mid-point,” say 322, at almost exactly the begin- 
ning of the fourth day of practice, whereas the upper group reached the 
corresponding point in their progress, say 514, at about .2 of the way 
beyond the beginning of the fifth practice period. Thus the lower 
group shows signs of having been less practiced at the beginning of 
the experiment on the mental function or functions to be trained in it, 
than was the upper group. Is it not possible also that this function 
or these functions are also represented in the intelligence tests to the 
extent that the difference in the scores of these two groups may have 
been in large part due to the greater practice of the so-called superior 
group. This is only a speculation, but it bears some support in the 
fact of greater initial improvement of the lower group. It raises a 
very interesting and important problem in intelligence testing and 
suggests a method of approach to its solution. We are undoubtedly 
taking too little account in tests of the respective amount of practice 
in the functions involved in them by the different subjects. 

It is not surprising that the subjects scoring lower in the initial 
period of the learning should show greater improvement than the 
group scoring higher. It is known that practice of several subjects 
in a given function, or at least in some functions, makes them become 
more and more alike in respect to this function,' though it is a frequent 
fallacy on the part of experimenters to misinterpret the data by 
measuring results in absolute gains in terms of amount of work done 
in a given time unit, without considering the other facts to which 
attention has been called in our present discussion. 





1 Peterson, Jos.: Thurstone’s Measures of Variability in Learning. Psycho- 
logical Bulletin, vol. XV, 1918, pp. 452-455, where other references are given. 
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LANGUAGE HANDICAP AND THE STANFORD 
REVISION OF THE BINET-SIMON TESTS 


FRED C. WALTERS 


Graduate Student, Teachers College, Columbia University 


The writer gave the Stanford Revision of the Binet-Simon Tests 
to 165 children in one of the New York City schools. Ninety-eight of 
these children were in Grade VIB and sixty-seven were in VIIA. The 
community in which the school was located was made up largely of 
foreign-language-speaking homes. It was thought that this foreign 
lanfuage factor would have some influence on the scores made on the 
tests by these children; so certain preliminary data were secured as the 
tests were given for the purposes of estimating this influence. Each 
child tested was asked the following questions: What language is 
spoken in the home? Is any other language spoken by either of the 
parents? Ifso,what language? ‘Towhat extent is it used when speak- 
ing to you? Do you speak any other language than English? Do 
you understand the foreign language your parent or parents speak? 
Have you ever attended any other school than the public school? 
Where there was an affirmative answer to this question, it was followed 
up by other questions to find out what school, the extent of attendance, 
and the extent, if at all, to which this attendance interfered with 
attendance at the regular public school. Fourteen children reported 
Saturday attendance at a Bohemian-language school. 

All of the children tested with the exception of two—one born in 
England and one in Canada—were born in this country; all but two or 
three had gone to an American public school for the term of their 
school-life; and all spoke fair unbroken English. The nationalities 
represented were Bohemian 47, German 43, Italian 16, Polish 5, 
French 1, Norwegian 1, Jewish 1, and American 51 (American here is 
used to designate all those of mixed extraction where English only was 
spoken by the parents). The occupations represented by the fathers 
of these children were: mechanic 14, carpenter 12, painter 8, metal 
worker 8, button factory worker 6, grocer 5, printer 5, marble worker 5, 
butcher 5, baker 5, electrician 5, automobile driver 8, salesman 5, 
brick layer 4, construction foreman 4, tailor 4, clerical worker 4, shoe 
factory worker 4, members of fire department 3, policeman 3, motor- 
man 2, expressman 2, plasterer 3, common laborer 14, and one each of 
a dozen other trades or occupations. 
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As a means for determining the influence of foreign language usage 
on the scores made on the tests, the test booklets were divided into two 
groups on the basis of the language predominantly spoken in the home. 
That is, if the home language was quite generally English, the booklets 
were put into one pile; if, however, a considerable amount of the home 
language was a foreign language, the booklets were put into a second 
pile. These two piles or groupings will hereafter be spoken of as 
Group A (the English-speaking-home group) and Group B (the foreign- 
language-speaking-home group). For the purposes of still further 
refinement of the study there was rejected from Group A all booklets 
where any language other than English was spoken by the parents, 
whether this language was reported to have been used in the home or 
not. And there was rejected from Group B all booklets where the 
home language was not predominantly foreign. These last two groups 
will hereafter be spoken of as Group C and Group D, respectively. 
Groups A and B include all of the 165 children tested and represent a 
rough division of these children into those from English-speaking 
homes and those from foreign-language-speaking homes. Groups C 
and D are those children of groups A and B, respectively, where there 
was no question about the language spoken in the home being either 
all English or very largely foreign. . Groups C and D should be 
thought of as refinements of groups A and B. 


Tue COMPOSITION OF THE GROUPS 


There was no marked difference on the basis of occupations of the 
fathers of these children between the groups. The trades and skilled 


‘ occupations were fairly proportionately distributed between groups A 


and B,and Cand D. The three members of the city fire department, 
the three policemen, and the two motormen all came from Group A; 
while the grocers, butchers and carpenters were found largely in Group 
B. The number of skilled occupations represented by Group A was 25; 
the number represented by Group B was 27. If a trend of occupations 
might be said to exist for either of the groups, it was in the direction 
of the clerical and semi-official for Group A and in the direction of the 
trades for Group B. The number in each group, the range of chrono- 
logical age for each group, the mean chronological age, and the mean 
mental age and IQ as shown by the tests are given in Table I. The 
last two rows of Table I, headed ‘‘Group A — B” and “Group C — 
D,” are differences and should be read “Group A minus B.” and 
“Group C minus D.” 
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TABLE I 
Groups — Pras. ose Mean Mean | Mean 
CA MA IQ 
group ages 

, «(RSE 165 10-4 to 15-5 12-8.4 |12- 2.3} 96.3 
0 Ree ree © 67 10-4 to 15-5 12-6.6 | 12- 9.8) 102.8 
MENG S Ghia 0 vs o KERSEI 98 10-8 to 15-3 12-9.6 |11-9.2| 91.9 
ET iis ok sane phe 51 10-4 to 15-5 12-7.3 | 12- 8.4| 101.7 
gins sighs 5 4g 8-2 76 10-8 to 15-3 12-9.5 | 11-10 92.3 
oe ee errr cae Ereeaeaknaea 0-( —3) 1- 0.6; 11.0 
Group C — D........... RM, he 8 a 0-(—2.2) | 0-10.4 9.4 














PROCEDURE FOLLOWED 


The general plan of procedure was the comparison of scores made on 
what might be termed non-language tests with scores made on tests 
presupposing average ability in English usage. The assumption 
underlying this comparison was the presupposition that what the 
children did on the non-language group of tests might be taken as a 
basis for estimating what they would do on the total group of tests if 
there were no language handicap operating. The group of tests that 
will hereafter be spoken of as the Non-language group and which was 
taken as the basis of comparison was composed of the Designs Test of 
Year X, the Ball and Field, Repeat Five Digits Backwards, and Pic- 
tures (interpretation) Tests of Year XII, the Induction Test, Arith- 
metic Reasoning, and Clock Tests of Year XIV, and the Problem of 
the Enclosed Boxes, Repeat Six Digits Backwards, and Code Tests of 
Year XVI. The tests selected because of the understanding and use of 
English which they presuppose were the Vocabulary and Naming 
60 Words Tests of Year X, Vocabulary, Abstract Words, and Fables 
Tests of Year XII, Vocabulary Test of Year XIV, and Vocabulary, 
Interpretation of Fables, and Difference between Abstract Words 
Tests of Year XVI. These tests will hereafter be spoken of as the 
Language group. 

All of the 165 children were given the tests for years X, XII, XIV, 
and XVI, excepting such children as passed all of the tests for Year 
XII, which year was usually given before giving the tests for Year X. 
Where all of the tests for Year XII were passed, full credit was given 
for Year X.% Each test of each of the four years was scored as “ passed” 
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or ‘‘failed,”’ and these scores were tabulated by years. The percentage 
of children passing each of the tests for each of the four years was then 
determined. ‘To make this clear, take Test I of Year XII. One hun- 
dred eighteen of the 165 children passed this test. Thus, the percent- 
age of “‘passed”’ was 71.5. The percentage passing Test 2 of Year 
XII was 65.4, etc. These percentages are given in Tables II, III, IV, 
and V. The last two rows of these tables are differences and show the 
percentages of children in one group passing the particular tests in 
excess of the percentages of children in the contrasted group passing 
the same tests. For example, in the row headed “Group A — B,”’ 


TABLE II 








Percentages of children passing the tests 















































Name of test v3 : a 
Voeabu-) Absurd-| 1 nag mn 60 
lary ities Report | heasion Words 
Total group........... 100 | 96.4 | 87.8 | 97 | 80 78.8 
Re 100 97 | 91 100 92.5 91 
Ns 6 occ edb ice 100 96 «86 95 71.4 70.4 
es 100 | 96 ~= 88 100 92.1 | 91.1 
GEE 6 0 Sasi sos 100 95 87 93.4 74 72.5 
Group A — B......... 0 1 Ct ae 5 21.1 20.6 
Group C — D.........1 0 1 | 1 6.6) 18.1 18.6 
| 
Tas_eE III 
Year XII 
Percentages of children passing the tests 
Name of test Ball Five Pictures 
: . Vocabu-| Abstract | |, | Dissected | 5.41., | Digits |“) 4. Simi- 
lary Words | j..1q | Sentences Back- reve larities 
| i i pre on 
Total group...... | 71.5 | 65.4 58 56.5 55.8 70 60 55.2 
Group A......... 91 | 83.6 68.2 71.6 71.6 64.2] 64.2 74.6 
Group B......... | 56.1 | 51 53 44 42 71.4 | 54 42 
Group C......... | 90.2 | 82.4 58 70.6 70.6 60.8 | 60.8 72.6 
Group D........ | 55.2 | 47.4 51.3 42 27.6 72.3 | 52.6 39.4 
Group A — B - 34.9 | 32.6 15.2 27.6 29.6 | — 7.2] 10.2 32.6 
Group C — D = Ss |. 6.7 28.6 43 —11.5 8.2 33.2 
| 


























280 The Journal of Educational Psychology 





















































TaBLe IV 
Year XIV | 
Percentages of children passing the tests 
Name of test Misedhs Induc- | President | Problems | Arithme- 
‘ery | tion and of tical | Clock 
'Y | Test | King Fact | Reasoning 

Total group..... 18.4 49 20.6 45.5 40 21.8 
Group A........ 37.3 48.5 28.4 57 40.3 22.4 
Group B........ 7 50 16.3 38 41 21.4 
Group C........ 37.3 40 27.4 60.8 35.3 11.8 
Group D....... 6.6 54 14.5 41 46 22.3 
Group A — B 30.3 — 1.5 12.1 19 — 0.7 1 
Group C — D 30.7 —14.0 12.9 19.8 —10.7 —10.5 

TABLE V 

Year XVI 

Percentages of children passing the tests 
" Differ- Problem Repeat _ 
Name of test Vocabu- — ence, of Six Code 
lary Fables Abstract | Enclosed Digits 
. Words Boxes | Backwards 

Total group 2.4 12.7 5.5 30 26 11 
Group A....... 6 21 12 34.3 28.4 20.7 
Group B....... 0 7 1 26.5 24.5 5.2 
Group C....... 6 21.6 11.8 33.3 25.5 19 
Group D 0 «9.2 1.3 30.3 28: 6.6 
Group A — B 6 | 34 11 7.8 3.9 15.5 
Group C — D 6 12.4 10.5 3 —2.5 12.4 























21.1 is found as the coordinate of the column headed ‘‘Comprehension.”’ 
This 21.1 is the percentage of children in Group A passing the ‘“‘Com- 
prehension” Test in excess of the percentage of children in Group B 
passing the same test. 

For the purposes of further comparison the percentages of children 
passing the tests were averaged separately for each of the four years 
and for each of the five groups, A, B, C, D, and the total group. These 
averages were then summed and divided by four to give a total average 
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percentage of tests passed by each of the groups. To illustrate, take 
Total group, Year X. The percentages of children passing the tests 
are 100, 96.4, 87.8, 97, 80, and 78.8 respectively for the six tests that 
make up Year X. The average of these six percentages is 90. So, 
also, for years XII, XIV, and XVI, with average percentages of 61.5, 
32.6, and 14.6. Summing these four averages and dividing by four 
gives 49.7, the total average percentage of tests passed by this group. 
Like averages were computed for the group of tests designated the 
Non-language group, and for the group of tests designated the Lan- 
guage group. These averages are given in Table VI. These same facts 
may be expressed in terms of months of mental age and are given in 
Table VII. The numbers of Table VII represent months of mental 
age and are averages for the several groups. They were determined 
by totalling the months “ passed”’ on the tests of years X, XII, XIV, 
and XVI. This procedure may be readily understood by thinking of 
the tests used in the Non-language group as “‘starred”’ tests. 

It will be seen from Tables VI and VII that for groups A and C 
there is fairly consistent performance on all three groupings of the tests, 
the performance on the Language group being in each case slightly 
higher than on the Non-language group. For groups B and D there 
is wide variation between the performance on the Language and Non- 
language groupings of the tests, and this variation holds proportion- 
ately for the total group of tests. This variation can best be expressed 
perhaps in terms of months of mental age. For Group B the average 
performance on the Non-language group of tests was 38.1 months, while 
the like performance on the Language group of tests was 24.4 months, 
or 13.7 months less. The variation for Group D was still wider, 
amounting to a difference of 17.3 months. When the performance of 
groups B and D on the Non-language group of tests is compared with 
their performance on the total group of tests, the difference is less 
marked. This is to be expected if there is a language handicap oper- 
ative, as the total group of tests includes the Non-language group. 
An estimate of the amount of language handicap operative for Group B 
on the total group of tests for the years X, XII, XIV, and XVI may be 
arrived at by taking the difference between the number of months of 
mental age passed on the total group of tests and the number of months 
of mental age passed on the Non-language group of tests, This differ- 
ence is 5.7 months. The like difference for Group D is 8.1 months. 
A slightly more refined estimate of the amount of language handicap 
is the difference in performance of groups A and B, and C and D, on 
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TaBLeE VI 





Total average percentage of tests passed 





| 
| 
| 














| All tests of the| Non-language | Language 

| four years group group 
a 49.7 52.4 | 44.8 
I ae Ag UP uin'n'a.0's | 57.1 55.3 | 57.0 
EE hus i oie hn Axo £3 44.5 50.4 36.2 
Heatseal a ae | 55.1 50.7 | 56.8 
eee weak eee. 54.5 | 44.7 52.1 34.7 
Group A —B................ | 12.6 4.9 | 20.8 
oN | Sane sae 10.4 —1.4 | 22.1 

TaBLeE VII 





_ Total months of mental age on years X, XII, 














XIV, and XVI 
All tests of the | Non-language Language 
| four years group group 
ee ae ECO af cL rr ! 
OT PO POET 36.9 39.8 32.5 
is b5«, o's-¢ degree baba 43.3 42.2 43.2 
IT 3 «4 & u.4-0's/cee oR ae oad 32.4 38.1 24.4 
Ss 2). PEAS, | 41.5 38.2 43.2 
Group D.. | 32.4 40.5 23.2 
Group A—B................ 10.9 4.1 18.8 
| a oe 9.1 —2.3 20.0 











the total group of tests minus their respective differences of perform- 
ance on the Non-language group of tests. These differences are 
expressed in Table VII by the differences between the numbers 10.9 
and 4.1, and 9.1 and —2.3, in the rows headed Group A — B and 
Group C — D, respectively. They take into account the superiority 
of performance of Groups A and C on the total group of tests over their 
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performance on the Non-language group of tests. The differences 
are, for Group B, 6.8 months and for Group D, 11.3 months. As some 
of the children taking the tests reached back into Year IX and others 
reached forward into Year X VIII, these estimates are slightly too low. 

The Code Test of Year XVI offered considerably more difficulty 
to Group B than to Group A. It was included in the Non-language 
group because it was thought to be representative of performance 
requiring but little language ability, and because it would add to the 
varied nature of the Non-language group of tests. Two language 
factors enter into the taking of this test and perhaps account in part 
for the difference of performance of Groups A and B. One is the fact 
that the instructions to the child for taking this test are long and 
wordy, and the other is that the correct performance of the test involves 
the ability to spell the code words ‘‘come quickly.’”’ Each of these 
factors it is believed operated somewhat more against Group B than 
against Group A. 

In giving the tests it was felt that the fables offered special difficulty 
to children from the foreign language-speaking homes. To test this 
impression, the individual scores made on the set of fables were averaged 
for each of the groups, 

These averages follow. Total group 3.73, Group A, 4.87; Group 
B, 3.0; Group C, 4.92; and Group D, 3.1. The average number of 
words correctly passed on the vocabulary at the back of the booklet 
was also computed for each of the groups. These averages were: Total 
group, 42.5; Group A, 46.5; Group B, 39.3; Group C, 46.1; and Group 
D, 39.1. The fables offered approximately 40 per cent more diffi- 
culty for Group B, relative to the performance of Group A, than did the 
vocabulary. 

The conclusion reached by the study is that there is a language 
handicap of from six to eight months of mental age on the test for 
children 13 years old coming from foreign-language-speaking homes. 
This conclusion is supported by what one would expect of children com- 
ing from homes where the parents are of the intellectual status that the 
parents of these children evidently are. The occupations of the fathers 
of these children mark them as of good average intelligence or above. 
Further, the fact that selection has already been operative to a con- 
siderable extent by the time children reach Grade VII of the public 
school is strong argument in support of the contention that the average 
intelligence of Grade VII school children is equal to the average 
intelligence of unselected children of like age. When children with a 
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language handicap are associated in school with children from English 
speaking homes, the forces of selection are likely to operate more rigor- 
ously against the children with the language handicap than would be 
the case were the children not so associated. It is the opinion of the 
writer that Group B of this study represents the more highly selected 
of the two groups. 








STANDARDIZED CONTENT IN HIGH SCHOOL 
BIOLOGY 


G. M. RUCH 
State University of Iowa 


AND 
LEO H. COSSMANN 
University of Oregon 


The present report deals with a preliminary attempt to survey the 
content of high school courses in biology throughout the United States. 
Biology presents, perhaps, the least uniformity in subject matter today 
of any of the sciences commonly included in the secondary curriculum. 
In the main this is a decidedly wholesome situation although doubtless 
some effort toward the selection of a minimum, basal content would 
have the effect of raising the standards of instruction in biology in 
the smaller schools. The urgent need is for rigid experimentation as a 
foundation for a semi-standardized content for such courses. In 
the absence of such objective data, the second best means of arriving 
at the desired end is that of surveys of what the best schools are teach- 
ing in biology. To do this properly we might have recourse to several 
methods of investigation, viz. Analysis of textbooks, analysis of 
courses of study, questionnaire studies, or compilations based upon final 
examinations which have actually been given over a period of years in 
leading schools of secondary rank. The advantages among these 
possible methods favor, in the main, the last mentioned since the final 
examination represents a more or less thoughtful attempt on the part 
of the teacher to isolate those facts and principles which can with most 
reason be expected to be retained by the pupil for some appreciable 
length of time after the completion of the formal work of the classroom 
and laboratory. It is for this reason that the authors chose to attempt 
to survey good teaching practice in biology by statistical analysis of 
final examinations collected from all parts of the country. The 
limitations of the method are as apparent as are its advantages and no 
claim is made that finality of content determinations can or should 
be based upon this method. 

Believing that the results of such a provisional investigation would 
possess enough merit to justify the effort expended, the authors’ 





1 Credit for most of the actual work is due to Leo H. Cossmann. 
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began in 1918 actual work on the problem the completion of which 
has, for many reasons, been sadly delayed. The method of attack 
will be very briefly presented in summarized form. 

1. A letter was sent to the Superintendent of Public Instruction in 
every State in the Union having a central officer asking that he list 
the names of those schools and teachers of his state who were doing the 
highest type of work in high school biology. Lists were received from 
23 states with a total of 126 schools answering to our specifications. 

2. Each of the 126 teachers was asked to send duplicate copies of 
all final examination questions used during the current year. Lists 
for previous years were also requested and in some cases were avail- 
able for periods of five or more years prior to 1918. The total number 
of questions received ran slightly over 2000. 

3. The 2000 questions were then analyzed for frequency and 
condensed and classified under convenient headings as 300 constantly 
recurring items. The group headings finally numbered 11 such phases 
of the total subject as: Definitions of terms, identification of struc- 
tures, life histories, diagrams, functions, economic and ecological facts, 
etc. 

4, The final list of 300 questions was submitted to about 100 of 
the leading teachers and authorities on the teaching of biology for 
ratings as follows: 

(a) “Rate ‘‘1” all questions which you consider to be entirely 
satisfactory and representative of the kind of knowledge which should 
be presented to high school students of biology, and which high school 
students can reasonably be expected to answer. 

(b) Rate “2” all questions which are partially satisfactory, but 
which you consider to be not highly desirable for any reason . . 
etc. ‘ 

(c) Rate “£3” all questions which are entirely unsatisfactory.”’ 

Provision was also made for the writing in of any other questions 
not already listed. Sixty-eight teachers and 9 authorities (authors of 
texts, professors of the teaching of science, etc.) rated all of the ques- 
tions submitted. This was a remarkably high percentage of responses 
and whatever selection did result from the failures to answer can safely 
be assumed to be a selection upwards and hence could not have oper- 
ated to decrease the reliability of the results. 

Space will not permit the tabulation of all of the data so rated. 
Only the most significant portions of the total determinations will be 
presented and the lists will be further restricted to those items rated 


ere 











Standardized Content in Biology 287 


“1”? by the 77 judges although there is doubtless a great deal of value 
to be had from the analysis and tabulation of the “2” ratings. 

It is in place at this point to refer to a second motive in carrying 
out this investigation. The authors hoped in advance (and this hope 
had been realized) to use the results in the production of at least two 
equal forms of a test for measuring accomplishment in high school 
biology. These tests are practically ready for publication and will 
be described in some detail in a later article.' 

Tables I to II which follow present the ratings of the two sets of 
judges together with very greatly condensed statements of the items 
rated. Two numbers occur at the right of each item. The first is the 
actual number of ‘‘1”’ ratings given by the 68 teachers and the second 
is the actual number of “1” ratings from the 9 authorities. The nota- 
tion “‘68:9”’ means, therefore, that the items was rated ‘1”’ by every 


judge in both groups, and similarly for ratings of less than perfect 
agreement. 


TaBLeE I.—ITems DEALING wiTtH Derinitions, Locations, IDENTIFICATIONS, 
Uses, AND Diacrams or TERMS 


icinss oo se seek eaie ie =| ree 67:8 
2. Excretion................. 68:8 27. Protoplasm.. dis 4ae GS 
3. Pistil..... . wo f Ae 67:9 
4. Reproduction............. Go:O 4 Sy Beepeeeey.................. Ge 
5. Oxidation — se 86 Ee aa 67:9 
6. Pollination . 68:9 31. Crop rotation............. 67:9 
7. Fertilization......... . 68:9 32. Carbohydrate............. 67:9 
8. Cross-pollination........... 68:9 33. Self-pollination............ 66:9 
I Oa wee 5 RES 68:9 34. Environment.............. 66:9 
10. Biology........ . 68:9 35. Fermentation............. 66:7 
Pe I eadeeiet eri oe CRE Gio eis ec cee cee. 66:8 
Te NG eC ESS TS a is SY 68:9 Pa das e's detec rol 66:9 
13. Protozoa 5 Gem ey ED. 5 ww cee eee les. 3. GBS 
14, Seed....... . 68:9 ee 65:9 
15. Respiration............... 68:9 a 65:8 
16. Ovary... le 65:8 
17. Chlorophyll............... 68:9 42. Struggle for existence....... 65:8 
18. Digestion. . 68:9 ictus scessddess..- 64:9 
19. Sterilization .. 68:9 44, Vertebrate................ 64:9 
20. Pollen.... PP PUGS UlW bs cceese.s..- 03 63:8 
21. Cotyledon................ 68:7 46. Gametophytes............. 63:4 
, 5 SP Leer 68:7 47. Pasteurization............. 63:9 
23. Evaporation.............. 68:8 48. Transpiration............. 63:8 
24. Sperm cell................ en 63:9 
25. Survival of fittest.......... SM MR cynic ccc s ecu selee 63:8 


1 The Ruch-Cossmann Biology Test, World Book Co., Yonkers, N. Y. 
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1 The lower ranges of all the distributions have been omitted in Tables I to 


II for purposes of economy of space and because of the lesser utility of such infor- 
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Operculum.. . 


Pathenogenesis....... | a , : | 
Hermaphroditism.......... 


Trichinosis.... 


TaBLe II.—Lire Histories or Various Forms 
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TaBLe III.—Comparisons oF FoRMS AND STRUCTURES 


Vertebrates and invertebrates 
Distinctions between moths and butterflies 
Differences between insects and spiders 
Comparisons of reproduction in fishes and flowering plants 
Differences between Reptilia and Amphibia. . 

Distinctions between beetles, grasshoppers, bugs, and flies. 
Comparisons of arteries and veins 
Internal structures of mono- and di-cotyledenous seeds 
Numbers of eggs laid by birds and fishes; with reasons 
Comparisons of food-getting and locomotion in Protozoa and insects... 
. Likenesses of the grasshopper and butterfly 
. Respiratory mechanisms of birds, frog, and grasshopper 
. Comparisons of frogs and fishes in appearance, means of protection, 
respiration, reproduction, locomotion, importance 
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14, 
15. 
16. 
17. 
18. 


19. 
20. 
21. 


23. 
24. 
25. 
26. 
27. 
28. 
29. 
30. 
31. 
32. 
33. 
34. 
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Comparisons of Protozoa and bacteria......................20600005 54:6 
Comparisons of saprophytes and parasites....................000000. 50:5 
Comparisons of structures of lowest animal and plant forms........... 49:3 
Larval and embryonic development...................00 0c cece eeeee 49:3 
Comparison of Paramoecium and frog for locomotion, food-getting, and 
res. aa’, lb G's haba ove ASE ws BWA Ss OOD Re 47 
Comparisons of Paramoecium and Amoeba.......................... 47 
Comparisons of legs of grasshopper, bee, and fly..................... 47 


Properties of carbon, sulphur, and phosphorus....................... 
Differences between earthworm and coelenterates.................... 
Differences between living and non-living materials 
Comparisons of plant and human nutrition. . ; 
Comparisons of circulation of the frog and the fishes. 


5.22 © © 6.6 ms O'S 215 2 AS. 


Methods of reproduction in the black mold and Spirogyra... 


Seen svs,esveoeeawese eeoewerse 
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Comparisons of Protozoa and Metazoa.......................00005. 41: 
Comparisons of respiratory system of earthworm and man............ 41: 
Comparisons of breathing of birds with other animals................ 41: 
Comparisons and diagrams of digestive systems of man, frog, and fishes. 41 

Comparisons of mouth parts of the bee and grasshopper............... 40: 
Nutrition of the molds, yeasts, and Spirogyra....................... 39: 


Digestive systems of the hydra, honey-bee, and earthworm............ 
Nutrition in Mucor with that of Spirogyra 


Ete. 


Ww 
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TaBLE IV.—DIAGRAMS AND DRAWINGS WITH FULL LABELS 


OE rare 68:9 14. Habit of mushroom......... 39:3 
2. Complete flower. . ..... 65:8 15. Carbon cycle............... 36:5 
3. Essential organs of flowers. . 62:8 16. Moss plant.. ES 
4, Germination stages of corn 17. Head of butterfly. . 34:3 
aces! cee ses 57:7 18. Breathing apparatus of fish. 36:4 
5. Typical insect.............. 57:6 19. Head of honey-bee.......... 26:3 

6. Heart and lungs............ 54:5 20. Nuclear changes in fertilized 
rere 52:6 TEAR RRS Ieee ae 20:0 
8. L. S. of bean and corn (seed 21. Structure of skin. . sae Me 
and fruit).... . 52:6 22. Maturation of sperm. . .. 18:0 

9. Paramoecium.............. 50:8 23. Chick embryo at 36, 48, ‘and 
ds ow kin sind Ge no 6 hn a 49:7 72 hours. . OO 
11. Diagrams of Mendel’s laws.. 47:4 24. Internal structure of crayfish. 15:3 
12. Hind leg of grasshopper and 25. Different types of feathers... 10:0 

Sa a a 41:5 26. Digestive tracts of bird and 
BO EG ovvnccccmacea 41:7 PSE RE 10:1 

Etc. 
TaBLE V.—ADAPTATIONS OF FoRMS 

1. Adaptations of human body against disease germs................... 65:9 
on nce 6 mwah dnt aims ecibee. WA > sade wa Fee eA 65:7 
3. Adaptations of flowers for attracting insects......................... 63:9 
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NE EEL LS OE Ee 62:7 
5. Adaptations of flowers for prevention of self-pollination............... 62:8 
6. Ecological adaptations of birds.......................0...0020..0.. 60:7 
7. Adaptations of fishes for locomotion.......................0.00000.. 58:7 
8. Adaptations of feet and beaks in birds; examples, and reasons......... 53:8 
9. Mouth parts of bee and grasshopper for food-getting......../........ 52:7 

10. Adaptations of proboscis of butterfly......................2.000005. 52:7 
11. Adaptations of frog for feeding, swallowing, and escape from enemies... 51:6 
12. Adaptations of hind legs of bee or grasshopper....................... 49:6 
13. Adaptations of clam and bird for locomotion, protection, and food- 


OF SS 


i ny 65 600-5 4h hd EASE. 040 CNIS ENE eke eke tenn den 49:6 
. Adaptations of hydra for capture and intake of food, elimination of 
waste materials, and reproduction................ccccccececcceces 41:3 
Ete. 
~ 


TasLeE VI.—CLASSIFICATIONS 


Name three orders of insects; with two examples of each.............. 52:4 
Outline of great plant groups with examples of each.................. 49:1 


Names and characteristics of nine of animal groups (phyla)........... 47:3 
Classification of sponge by phylum name........................... 47:1 


Classification of following forms: Amoeba, toad, crab, slug, seagull, 
chicken, dragon fly, Culex mosquito, locust, moth, bee, mussel, 
scorpion, robin, salmon, man, frog, snail, clam, butterfly; house-fly, 
Ch sd nkd dns cthoaeh ace ae eae 464s Kaew keh snes 47:5 


6. Give the classification of the Arthropoda with characteristics of each... 47:1 
7. The two great divisions of the seed-bearing plants, and four divisions 

IEE Ce eee Tea TT 44:3 
8. Classes of invertebrates with examples..........................00.. 43:1 
9. Classes of mammals with examples..... ................0..0 000005. 38:1 
ES a ne 33:1 
11. Divisions of the Gymnosperms (as detailed as possible)............... 20:0 

Ete. 

Taste VII.—Economic ImportTaNce; BENEFICIAL OR INJURIOUS 
Resa oinas CHOSE we oe 0 Re 57:8 
is ons c's osu x sueeets 68:9 15. Grasshoppers.... . 57:8 
Pe .-. CBB 2G Marihworms................ 52:9 
I ag vice 450 0 0s cole Es ee cee teense GED 
5. Yeasts.. . 68:9 18. Butterflies. ............... 52:8 
NN i sou cies ossu' baad GEee. Ue. Blomeey.................. 50:6 
7. Potato beetle........ SS fC SE eee 48:4 
cites oo. cécca's ee 46:4 
9. San Jose Scale............. Re 44:6 
Ts n.0 oo 00 rv cteeeiateas dn Sas 5 ko ess cc cees cds 44:6 
SS Jos Seo ca cea nbao ene es on a ccc cose cates 39:3 
12. Ladybird beetle............. eo ec ss cscs ess teeens 35:4 
13. Bumble bee..... .......... reer 33:2 
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TaBLeE VIII.—History or BioLtocy (Evo.vuTion) 
1. With what facts are the following names associated? 


So Luis aod iad bcs ose ERY Cacas Ses e's bans cee 4d0a 52:4 

WS 5 osc b eee ed meee whe ik ee I 0 ss Pick ep ieres deces 51:6 

aks bath aha See Me SN eens eee Bae ae 49:6 

Re Ve ets), ig 5 OY eerie Seca as > a6 eee tare 47:5 

De Vries....................... 60:1  Schleiden and Schwann......... 44:3 

SR PREC RIRLE US @ Pae Ot GIA coc v sb adusosacccss 41:4 

Ete. 

2. What is the theory of evolution: Who is the originator?............. 62:9 

S. What ave the henwews of hevedity?.... 20.06... c cc cece tebe devon 47:6 
4. What part did geology play in the development of the theory of evolu- 

ORR ae Ma SS din a ta ols ts nw nee ak anal oe Dana be Go 39:6 

5. State four ways of accounting for the evolution of plants and animals.. 39:4 
6. Trace the development of the nervous systems in the animals you have 

NCU Cee Oe Bika ns +o Giniole bab W oaHe 6.6 a oO ob nik ees Ee bed 37:4 

7. What is the Law of Recapitulation?........ 0... ccc cnc cc cw cs vcccese 35:3 

8. Why do scientists believe that birds have descended from reptiles?..... 35:4 
9. What transition family remains between the invertebrates and verte- 

NG re cere, sss a th a SOE olen an Ole omen aa ce 4 ik aladdbe 31:2 

10. What did the Echinoderms contribute to the development of animals?. 27:1 

11. Trace the evolution of the heart from the fishes to the mammals....... 22:3 


TaB.Le 1X.—GENERAL Descriptions, Discussions, AND EXPLANATIONS; LARGELY 


or Economic AND Hy 


GIENIC IMPORTANCE 


1. State how the following measures aid in controlling the spread of epi- 
demics: (a) quarantine, (6) vaccination, (c) boiling drinking water, 


and (d) school nurses.............. 


i dh ca aia de'al Jb 0'e eee eoanaac eae eealanal 68:9 
2. Discuss importance of forests as to: (a) climate, (6) soil, (c) erosion.... 68:8 
3. Discuss the following: 

Disposal of garbage............. 68:9 Dangers of patent medicine...... 63:8 

Reasons for cooking food... .... 68:9 Preservatives.................. 63:8 

Sources of contamination of food Coughing and sneezing.......... 62:9 

and water.......-. adie Wide seqgodaee SER E ss 61:8 

Bow animals carry Giseage. 2009 OBST Effects of alcohol on physical 

Diseases caused by bacteria... ... 66:9 Till aan 60:8 

Disinfectants................... 65:9 Peg ee eek nae ; 

Bacteria and agriculture......... 65:8 reatment of sprains a Apia gt 57:7 

Oiling of streets................ 65:8 Bacteria and dairying ste eeceenes 57:7 

Tobacco and scholarship... ...... 65:8 Prevention of spinal curvature... 54:8 

Noise and mouth breathing... ... 65:9 Nitrification and denitrification.. 51:4 

Use of the common drinking cup 65:9 Soil fertility and improvement... 49:6 
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4, Explain the following: 


How the leaf does its work....... 65:7 Inheritance of feeble-mindedness. 54:6 
Methods of seed dispersal. . . 63:8 Rules-of hygiene of the circula- 
Necessity for seed dispersal...... 63:8 tion, respiration and digestion. 53:6 
Meaning of ‘Environment”’ .. 63:8 Composition of human blood.... 52:9 
Principles of ventilation. . = _ 2 (i | ReaG eee 52:8 
Inter-dependence of plants ‘and NE ETT 51:7 

ls 04s nae a0 oacens os 60:7 Why the Paramoecium is a 
Development of the seed from the “higher” animal than the 

Sd +s. é datereas er he tins 58:7 NG cide he bance we 8's 6:6» 49:5 
Relation of Protozoa to disease. 57:8 Most economical methods of in- 
Food manufacture in plants. ..... 57:7 creasing soil fertility and pro- 
Formation of habits............. 54:6 as sa cae 6 able dere bc 49:6 

Ete. 
5. Describe the following: 
RS s4 ch bMS abe e ce neaeaeny 49:3 External and internal structure 
Community life of the ants...... 49:4 I ss 5364 oicshs ss 5 'a'ne-0'¢ 0.0 36:4 
Structure of human brain........ 44:3 Stages in manufacture of vinegar 
One of the crustaceans.......... 41:4 re cs 00 ie & 0 6 35:5 
One of the fishes................ 41:5 Sponge in detail................ 20:3 
An ant colony.................. 3B:4 
: Ete. 


TaBLE X.—FUNCTIONS OF STRUCTURES AND PROCESSES 


DN is Lin cute kale n tie 68:9 11. Chloroplasts............... 62:7 
2. Essential organs of plants.... 68:9 12. Organs of circulation in man. 61:7 
ER ie ee 68:6 13. Plasma, lymph, thoracic duct 60:8 
Gk sce naka hee 68:7 14. Proboscis of butterfly....... 60:7 
ID, canting delete ies 68:8 15. Digestive glands of man..... 57:8 
es As ewes we i 68:6 16. Parts of bean seed.......... 55:6 
7. Circulation of the blood..... 68:6 17. Functions common to all 
oe cach aioe adult a he 6.8 68:8 Rs oa vibes te pip divow 4.0 §1:5 
9. White and red blood cor- is I a techies sie ty hen Kee « 47:5 
I os cbka cb cdend bee k's 65:8 19. Hind feet of honey bee...... 43:5 
10. Functions of protoplasm. .... 62:8 20. Parte of brain.............. 42:7 
Ete. 


TaBLE XI.—UNCLASSIFIED MATERIALS 


1. What should be the farmer’s attitude toward the following: Snakes, 
owls, hawks, cats, crows, wasps, poisoning of gophers, blackbirds, 


ichneumon flies, and kingbirds?.....................00 0c c ee eeues 65:8 
2. Name two foods rich in protein, grape sugar, and starch............... 65:9 
3. Name important excretions of plants and animals.................... 65:5 
4. Name the most effective ways of killing Eee Pee re 64:8 
5. Name three economical foods rich in fuel value and three foods rich in 


building materials............... 


SEDER SEK EOD A sccenaccrouveseeus 63:9 
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6. Tests for starch, protein, grape sugar, and fats. . TERT ECT 
7. Name the most common chemical elements in foods. . ih a dons “aca 
8. Name the elements used by green plants in food manufacture. eee es 61:8 
9. Name the chief nutrients in a bowl of bread and milk................ 60:9 
10. State the characteristics of each of the following and state uses to living 
forme: Carbon, Oxygem, NitTOMeM........ 2. cece cece cc ec tc ccccece 60:8 
11. What are five essential conditions for plant growth?.................. 57:6 
12. Fill out the following table with reference to disease:. . . Gs al dee bes hocecaes 57:8 
Best Pre- 
ORGANISM Exact Mope VENTIVE 
DISEASE CONCERNED or INFECTION MEASURES 
Se Ei iil g's big en me AS a how 6 AURA Rl eR AOR ale oe ot a Baa 
a eas Dy. 5 9h bh ah 8 hob 04s «ne < CROSSE MC ORDER. 
a ahs Lik aes gk gla aw n,n. pcan: ies RS NL SUEY 
Nee so 6k a at wgn a bat Rae e eee Cae abd wbwk seaees 
13. Name and give the characteristics of four chemical elements........... 55:8 
14, Give six methods of propagation in plants. . ad ak ed Bk hate 
15. Name five nutrients and their uses to the human body. PERE ee 54:6 
16. How is the heart affected by violent exercise?....................... 53:6 
17. Name three ways in which the heart is protected by the structure of 
a oo chain ela nid beh dene ae k's ah n'a i chee 53:6 
18. What two kinds of muscles are found in the human body?............ 54:6 
19. What is the chief use of carbon in the human body?.. erick eee 
20. State number, position, and kinds of teeth in man................... 51:8 
21. Does alcohol tend to increase feeble-mindedness?.................... 51:8 
22. What are the proper proportions for a standard daily diet?............ 50:8 
23. Name three processes of fermentation, stating the organism responsible, 
and the substances acted upon...............c cee ee cece cece ceeces 48:3 
24. Name five tissues of the human body and characterize each........... 44:4 
25. Give in tabular form: (a) the digestive juices, (b) the enzymes found 
in each, and (c) the action of each enzyme...................0000- 43:5 
26. State whether the following. are physical or chemical changes: 
Action of diastase on starch 
Dissolving of sugar 
Osmosis of sugar 
Oxidation of sugar 
Dessication of spores , 39:5 
27. What are the specific effects of muscular exercise? . ia dt cau kee ee ews 34:4 
28. Outline the divisions of the human skeleton and name the bones in each 
tii kcal a 5 «aid Pa ae Saale dae ok aA 68d keke 28:3 
29. Name the principal tissues formed from the middle layer of the gastrula 26:1 


Etc. 
Attention should be specifically drawn to the fact that in each table 


which has been presented the lower ranges have been eliminated. No 
systematic rule was observed in this arbitrary elimination of the lowest 
ratings. In general, the printed tables cover from 50 per cent to 75 
per cent of the total range, the average being not far from 70 per cent. 
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Certain of the groups of items have been much more drastically 
curtailed than others upon a basis of rough inspection of relative merit. 
The only justification for this procedure which is offered reduces to 
questions of limitation of available space. 

Although inspection of the tabulations raises many very significant 
questions, only a very few can be discussed here. In Table VI, for 
example, there is a very evident tendency for the group of professional 
experts to minimize the importance of the taxonomic relations. But 
one of the questions dealing with classification is endorsed by more 
than 50 per cent of the professional judges in comparison with the fact 
that 9 of the 11 of these items are selected by more than 50 per cent of 
the teacher group. This finding is quite in harmony with the litera- 
ture on the teaching of biology during recent years and is probably 
significant evidence that systematic problems are gradually being 
eliminated from our courses in biology. Similar tendencies against the 
teaching of biological technicalities among the professional students 
of the problem can be found in other tables, e.g., Table IV. In general 
the professional group can be shown to favor economic and practical 
problems in high school science. 

A final question which must be answered statistically is the matter 
of the general reliability of the judgments obtained. The reliability 
coefficients appearing in column (a) of Table XII which follows were 
obtained by correlating the numbers of ‘‘1” judgments given by the 
professional judges with those of the teachers; the numbers being 68 
and 9, respectively. Column (6) gives the probable error of the 
coefficients (Pearson Product-moment formula). Column (c) gives 
the reliabilities for the sums of judgments in both groups as estimated 
by Brown’s formula: 


mr 


r(68 + 9)(68' + 9") = Ta iy 





The interpretation of these reliability coefficients is very difficult 
since they are computed from the actual judgments recorded in the 
tables given above rather than from the complete data which included 
the lowest 25 per cent to 50 per cent of the range as well. This serious 
curtailment of the range and hence of the standard deviations will of 
necessity cause the reported coefficients to fall markedly short of the 
true values based upon the full range. As would be expected the 
reliability coefficients are lowest in those tables which are most abbre- 
viated in the lower ranges. We can conclude without much danger of 
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error that all of the reliabilities are moderately or very high, particu- 
larly for the data in Tables I, II, IV, VII, and 1X. Even in the cases 
of Tables III and X which appear to be very low, it will be noticed that 
hardly 50 per cent of the total range was reported and used in the com- 
putations. Upon the assumption that the true sigma for the whole 
data is nearly twice the size of the sigma for the reported data in any 
of the distributions, it can be prophecied from Kelley’s formula 





o _ Vr(l— R), 
or /R(1 —1r) 


. 1 . . . 
that, upon the assumption that = = 5) the prophecied R’s will, with 
T 


one exception, all exceed 0.80. The validity of this assumption 
concerning the true range can be roughly tested by inspection of the 
tabulated results. The matter of reliability can properly be dismissed 
with the statement that the obtained values are about as high as are 
ordinarily secured by the method of estimates even without the correc- 
tions attempted, and that if the described corrections are applied as 
hypothecated, the reliabilities are unusually high in most cases. 


TABLE XII.—RELIABILITY OF THE JUDGMENTS 





(c) 


(a) ee 
Table ; (b) r for combined judg- 
number ‘aa PE ments estimated by 


Brown’s formula 





1 .80 .02 82 
2 75 .08 77 
3 35 .10 .38 
4 91 02 .92 
5 72 .09 74 
6 .58 .13 .61 
7 .79 .05 81 
8 74 07 .76 
9 .83 .03 85 
10 54 ll 57 
11 . 58 .08 .61 




















MENTAL AGE AND SCHOOL ATTAINMENT OF 1007 
RETARDED CHILDREN IN MASSACHUSETTS 


PREPARED UNDER THE DIRECTION OF AMY HEWES 
BY MILDRED HOLT, SOPHIE MERANSKI, JULIA SNELL 


Mt. Holyoke College 


The educational waste caused by having children of retarded 
mental development in the regularly graded classes has long been 
apparent. Both the slow children and those who learn rapidly suffer 
unless the course of study is adapted to their capacity. In 1919 the 
Massachusetts Legislature enacted a statute which requires that ‘‘the 
school committee of each city and town in which there are 10 or more 
children three years or more retarded shall establish special classes to 
give such children instruction adapted to their mental attainments,’’! 

To carry out the provisions of the act, traveling clinics have been 
established for the examination of every retarded pupil in the State. 
These examinations are comprehensive and include a number of 
different fields of inquiry such as physical condition, family history, 
economic and social history, moral reactions, and practical knowledge, 
in addition to school attainment and mental age, which are the two 
findings especially dealt with here. The Stanford Revision of the 
Binet-Simon Tests was used to determine mental age, and especially 
prepared school tests to establish grade standing in the different 
school subjects. Experience in administering the Massachusetts law 
has emphasized the fact that the diagnostic significance of the compre- 
hensive inquiry is very great, especially when used as a basis for 
advice and recommendations for individual children. A somewhat 
separate problem is that of ascertaining the grade of school work which 
the child is able to perform. In general the retarded children were 
“found in school grades two or three or more years above the grades 
whose work they were able to do.”? An important question for the 
future school program, then, is: What correlation exists between the 
mental age level ascertained and the degree of educational proficiency 
as shown by the grade reached in the separate examinations in the 
different educational subjects. This is an inquiry which may have 
implications as well regarding the general appropriateness of different 





1 Massachusetts General Acts, Chapter 277, Approved July 1, 1919, Sec. 2. 
2 Fernald, W. E.: The Inauguration of a State-wide Public Mental Clinic in 
Massachusetts. Mental Hygiene, Vol. 6, July, 1922, p. 3. 
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subjects in the curriculum and possibly also the effectiveness of the 
pedagogical methods employed in each. 

The Material Used.—The records on which the study is based were 
made from examinations of 1007 retarded children in 1922, and include 
the determination of mental age, intelligence quotient, and the pro- 
ficiency attained in each of the following elementary subjects: Read- 
ing, arithmetic, spelling, writing, geography, and language. They 
were taken from records made in cities in eastern Massachusetts and 
were drawn off from each city in the order in which the children 
happened to be examined. The records were plotted on charts to 
make easy comparison with the corresponding school performance of 
the normal child. 

In these charts the mental ages and grades which indicate the usual 
performance of those ages are arranged in adjacent parallel columns. 
Thus the examiners assume that a child who has a mental age of six 
can ordinarily do Grade I work, one with a mental age of seven, Grade 
II work, so that in most cases the school performance in the separate 
subjects is indicated to the right in a more or less straight line roughly 
corresponding to the appropriate grade. The records were tabulated 
to show the distribution of children according to mental age and school 
attainment in each subject, and the Pearsonian Coefficient of Correla- 
tion was computed. 


TABLE I.—DISTRIBUTION OF 1007 RETARDED CHILDREN BY MENTAL’ AGE 








| 
Mental age in years Total | Girls Boys 
ELE ‘ Seiiateteculel 1007 | 357 650 
eT se aweneneecenes , 2 | 2 
a 4 1 4 
eS Ee Sr ee ital abe 158 | 65 93 
Fish ns cde sixiaiede ren toves 274 | 110 164 
EE Pe ee , $831 | 106 225 
a a ks kb oe celexa 46 Ode 166 55 111 
10 and less than 11.................... nadine ee: ae eee 40 
BE I oi ccc ea Sel dw keecaeec’ | ae 4 12 
EE a | ae ae 1 
EE Se ee ee ee | | 
I s B0G a oo 8s Ca dc eke et | ae 1 
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The Group Studied.—The group was composed of 357 girls and 650 
boys. The chronological age varied from seven to seventeen years 
inclusive, and about half of the children were under thirteen years of 
age. The mental ages varied from four to fourteen years, and less 
than half (44.0 per cent) of the retarded children had a mental age of 
less than eight vears (see Table I). 

More than two-thirds (67.3 per cent) had an intelligence quotient 
of less than 70 and would ordinarily be classified ‘‘feeble-minded.”’ 
The remaining third (32.7 per cent) included the difficult borderline 
cases and four pupils with intelligence quotients of 100 and over whose 


retardation was clearly due to cause other than mental incapacity 
(see Table II). 


TaBLeE II.—Disrripution or 1007 RetTarpEep CHILDREN BY INTELLIGENCE 








QUOTIENT 
Intelligence quotient | Number of children 
, 

INS tend ened ou panied bietit dds chou aed eee eked ox 1007 
I Me te ae Wee wile eneewek | 4 
ee es eenes bene bubedeuet | 43 
i eras a ow we eu ble eS ble « | 221 
li iiied 69 asd niticdalry said pan el | 410 
in hae 1568606 ot hehe s » Me ROR +9 | 250 
ee  gceueuees odbeseéeas | 59 
ee co edhibeessseceeces 16 

I, dsb OLAS Utd. Dds cen Vdabedsckvaliivivi 4 





The Significance of the Correlation.—The coefficients of correlation 
given below show the extent to which school performance in the various 
subjects corresponds to the mental age of the boys and girls tested. 
The correlation is high for all subjects, but highest of all for language, 


arithmetic, and writing. and lowest for geography. 


CORRELATION: 
MENTAL AGE AND 
ATTAINMENT IN 
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The explanation of the differences in the degree ofcorrespondence is 
not hard to find. The subjects in which the correlation is highest are 
those subjects which are employed or paralleled in the mental tests them- 
selves. Language, for which the coefficient is highest, is, of course, the 
mechanism by which the child whose mental age is being determined 
expresses his capacity. In giving the definitions or solutions of prob- 
lems required in the mental tests his rating rests in large part on his 
ability to express himself. Arithmetic, the second in rank, is itself used 
in the mental tests for older children, and also requires a close parallel 
to the reasoning processes involved in the answers to some of the 
questions. Writing, for which the correlation is also high, corresponds 
to still a third aspect of the mental tests; it requires very nearly the 
same motor coérdinations as those involved in the copying problems 
used in the tests. 

The lower correlation for geography is equally inevitable when the 
nature of the mental tests is recalled. These tests involve almost 
nothing of the pure memory work which geography is likely to be. 
That is, in this case the subject is not utilized in the very tests by which 
mental age was originally determined. Reading and spelling, which 
stand nearer the middle of the scale of correspondence, are less easy to 
classify. They are partially represented in the tests, but their use is 
not so frequently paralleled. 

The closeness of correspondence between school performance and 
the mental age of defective children is, therefore, fore-ordained, in 
those school subjects which are themselves represented in the mental 
tests used. Miss Mary Moran, head-teacher in the Massachusetts 
School for the Feeble-minded writes: ‘‘In the majority of cases we 
can almost predict from the school tests what the mental age will be.” 
The danger of using this type of evidence as the basis of conclusions on 
the suitability or efficacy of any one type of mental tests is therefore 
apparent. The findings of the study on this point raise the same 
questions as are to be found in the criticism of psychologists who have 
discussed the determination of native intelligence by the tests. 

The performances of retarded children show a considerable degree 
of variation among the school subjects. The extremes are spellinz, 
in which a majority (60.7 per cent) failed to make the grade correspond- 
ing to their mental ages; and writing, in which more than one-half 
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(53.9 per cent) were above the grade to which their mental ages entitled 


them (see Table ITI). 


TaBLE III.—PERCENTAGE DISTRIBUTION OF RETARDED CHILDREN IN RELATION 


To GRADE CORRESPONDING TO MENTAL AGE 





Children in relation to grade corre- 
sponding to mental age 
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Per cent 
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The analysis of the records for the 1007 retarded children leaves 
the question open as to whether the high correlation of mental age can 
have any particular significance as long as the two sets of tests have 
common ground. Uniil the psychologists devise a test of intelligence 
which does not depend mainly on school performance their assistance 
for school grading is of little value. 
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The explanation of the differences in the degree ofcorrespondence is 
not hard to find. The subjects in which the correlation is highest are 
those subjects which are employed or paralleled in the mental tests them- 
selves. Language, for which the coefficient is highest, is, of course, the 
mechanism by which the child whose mental age is being determined 
expresses his capacity. In giving the definitions or solutions of prob- 
lems required in the mental tests his rating rests in large part on his 
ability to express himself. Arithmetic, the second in rank, is itself used 
in the mental tests for older children, and also requires a close parallel 
to the reasoning processes involved in the answers to some of the 
questions. Writing, for which the correlation is also high, corresponds 
to still a third aspect of the mental tests; it requires very nearly the 
same motor coérdinations as those involved in the copying problems 
used in the tests. 

The lower correlation for geography is equally inevitable when the 
nature of the mental tests is recalled. These tests involve almost 
nothing of the pure memory work which geography is likely to be. 
That is, in this case the subject is not utilized in the very tests by which 
mental age was originally determined. Reading and spelling, which 
stand nearer the middle of the scale of correspondence, are less easy to 
classify. They are partially represented in the tests, but their use is 
not so frequently paralleled. 

The closeness of correspondence between school performance and 
the mental age of defective children is, therefore, fore-ordained, in 
those school subjects which are themselves represented in the mental 
tests used. Miss Mary Moran, head-teacher in the Massachusetts 
School for the Feeble-minded writes: “In the majority of cases we 
can almost predict from the school tests what the mental age will be.’’! 
The danger of using this type of evidence as the basis of conclusions on 
the suitability or efficacy of any one type of mental tests is therefore 
apparent. The findings of the study on this point raise the same 
questions as are to be found in the criticism of psychologists who have 
discussed the determination of native intelligence by the tests. 

The performances of retarded children show a considerable degree 
of variation among the school subjects. The extremes are spelling, 
in which a majority (60.7 per cent) failed to make the grade correspond- 
ing to their mental ages; and writing, in which more than one-half 





1 Moran, Mary F.: ‘‘School Tests for School Grading and Measures of Intelli- 
gence.”” Pamphlet, n. d. 
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(53.9 per cent) were above the grade to which their mental ages entitled 
them (see Table ITI). 


TaBLE III.—PERCENTAGE DISTRIBUTION OF RETARDED CHILDREN IN RELATION 
To GRADE CORRESPONDING TO MENTAL AGE 





Children in relation to grade corre- 
sponding to mental age 
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Percent | Per cent Per cent 
below within above 
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The analysis of the records for the 1007 retarded children leaves 
the question open as to whether the high correlation of mental age can 
have any particular significance as long as the two sets of tests have 
common ground. Until the psychologists devise a test of intelligence 
which does not depend mainly on school performance their assistance 
for school grading is of little value. 








THE SIGNIFICANCE OF WEIGHTED SCORES 
PAUL V. WEST 


University of Chattanooga 


After a great deal of emphasis on the necessity of weighting of 
specific items or problems in examinations and tests, doubt is begin- 
ning to be manifested regarding the value of such emphasis. Doug- 
lass and Spencer! present a report of statistical treatment of results 
of certain standard tests, 7.e., Douglass Algebra, Henmon Latin, Gregory 
Language, and Monroe Silent Reading, and although no definite conclu- 
sion is stated, all the data presented are consistent in showing weight- 
ing to be of negligible value, with various groups ranging in size from 11 
to 32 pupils in the Junior High School. Charter’s contention that the 
correlation of weighted and unweighted scores is over .90 is found to 
be sustained. 

The problem here noted arose in connection with the construction 
of a test of ability to reproduce the thought in passages read aloud by 
the examiner. Two tests of a similar nature were used, each consisting 
of five parts. In Test A the points of possible credit were: Part I, 
11; Part II, 14; Part III, 20; Part IV, 11; Part V, 18; or a total of 74. 
In Test B the credit points were: Part I, 11; Part II, 19; Part ITI, 24; 
Part IV, 25; Part V, 21; or a total of 100. The two tests were given on 
different days to a group of 45 High School students of mixed grades 
from 8 to 12. 

The question then arose as to whether the score on each test should 
be given as the aggregate of the simple points gotten on the test or 
whether each point, or item, should be weighted as to difficulty. 
Both methods were followed and results compared. 

The process of weighting used was simply (1) Tabulation of 
results so as to show which items were passed satisfactorily by each 
member of the class. (2) Computation of the percentage of failure of 
the subjects on each item. (3) Translation of the values thus found 
into terms of ‘percentile’ value by the use of the tabular form, p. 
392 of Rugg “Statistical Methods Applied to Education.” (4) Addi- 
tion of these values, and computation of the percentage ratio that 
each item value held to the total, so that the sum of all these values 
would be equal to 100. 


11s it Necessary to Weight Exercises in Standard Tests: Journal of Edu- 
cational Psychology, Vol. XIV, No. 2, Feb., 1923, pp. 109-112. 
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Each pupil’s score was now found as follows: Pupil (a) passed 
successfully points 1, 2, 5, 8, 9, 11 in Part I of Test A giving an 
unweighted score of 6. But the weights found on these items were 
respectively .6, .6, .9, .8, .9, 1.4 giving a weighted score of 5.2. For the 
sake of convenience in interpretation all scores for each part as well as 
for the total were reduced to percentage values of the total possible 
score. For example: Subject (a) made a raw score of 6 points on Part 
I out of a possible 11 points giving him a percent score of 55. When 
the points were weighted he made a score of 5.2 out of a possible 10.3 
or a percent score of 50.5. 

Coefficients of correlation were computed by the Product-moment 
Method for the two sets of scores, raw and weighted, the scores being 
handled in discrete series. The results were unexpectedly high as 
shown in Table I. None of these coefficients shows a low enough rela- 
tion to justify the conclusion that the weighting is greatly significant 
in changing the score or the ranking of any one of the individuals. 
The chances are greatly in favor of the simple point score being as 
correct as the weighted score. 


TaBLE I.—CokEFFICIENTS OF CORRELATION BETWEEN RAW AND WEIGHTED SCORES 
FOR THE Five Parts or Eacu or Two TEsts oF COMPREHENSION 
oF PassaGEes Reap ALoup TO STUDENTS 
Part I PartTII PartIil ParrIV PartrV ToTaL 


Es hick nals tae .991 . 996 .987 .997 .993 .995 
ka Seeds ace aes .995 .995 . 992 . 996 . 998 .997 


When one adds to this fact the difficulty of securing accuracy at 
every involved stage of the computations and the statistical manipula- 
tions requisite to the weighting procedure the reasons for weighting 
the items in this particular test are not easily found. 

It is, however, possible that weighting may not be needed in one 
type of test while greatly needed in another type. The weighting 
process assumes a natural difficulty of certain items for the members of 
the group tested, so that only the very able will be able to pass the most 
difficult at all satisfactorily. But in the tests cited above much 
depended evidently on incidents of interest, attention and experience of 
the subject, so that the poor performer might pass in items which would 
give failure to the better worker and fail in those items which would 
prove very easy for the group as a whole. 

Furthermore the number of items which are taken as the base in 
computing the percentage of failure will probably influence the weight- 
ing factor greatly. For example, in the above analysis, if the percent- 
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age of failure had been computed for each of the parts of the tests 
independently of any other part instead of grouping all of the points 
first, it is evident that the weights would be correspondingly more 
greatly differentiated. 

Form 8 of the Army Alpha Test was given to the same group of 
pupils and the results for each of the eight test groups were analyzed 
and studied by the same process as before noted. The items of Test I 
(Directions) were scored by points as is customary. The 12 items in 
this test were then weighted, omissions and failures both being counted 
in the percentage of failure. In the Army Alpha Test each test is given 
with a definite time limit in contrast with the test previously noted, in 
which all subjects had all the time needed to give an answer to each 
question after it was orally presented following the reading. 

Since the Alpha Test is an “‘intelligence”’ test one might anticipate 
that the element of inherent difficulty in certain elements of the tests 
would be more apparent and count for more in the weighting process. 
Especially might this be true since the items are scaled and the more 
difficult would tend to be omitted as well as wrongly answered to a 
greater degree. 

Tests IV and V are those which require alternative responses and 
are scored by the Right Minus Wrong Method, consequently do not 
yield to weighting readily, at least by the above process. The question 
arises as to how the weightings should be applied so as to make allow- 
ance for guessing. The weightings were found as with other tests, but 
when individual scores were determined it was found that the errors 
often penalized unduly, and the results were discarded. 

The remaining tests 1, 2, 3, 6, 7 and 8 were analyzed to note the 
effect of weighting. The weighted scores were correlated with the 
unweighted scores for each test with the results given in Table II. It 
is noteworthy that, although these coefficients are indeed high they are 
all lower than any of the coefficients in the case of the first tests 
worked with. Tests 3 and 8, are markedly lower than the others. 
This would seem to indicate either that the weighting of a smaller num- 
ber of test items in a group gives greater significance to the weights or 
that certain tests make a greater demand for weighting, possibly due 


to the fact that they are more adequate measures of specific or general 
ability than others. 


TaBLE II.—CoEFFICIENTS OF CORRELATION BETWEEN RAW AND WEIGHTED 
ScorES FOR CERTAIN TESTS OF THE ARMY ALPHA TEST 
Test 1 Test 2 Test 3 Test 6 Test 7 Test § 


.975 .956 . 932 . 966 .984 . 9402 
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With correlations of raw and weighted scores as high as was the case 
in Table I it is evident that the intercorrelations of the tests would not 
be greatly varied when the scores were weighted as compared with 
those from unweighted scores. But the Tests in the Army Alpha were 
correlated with each other, first by the use of raw scores, then by the 
use of weighted scores. In Table III the latter coefficients are shown 
in bold faced type. The marked agreement of many of the pairs of 
coefficients is striking as in the correlations of tests 1-6, 1-7, 1-8, 3-6, 
3-8, 6-7. Some of the coefficients are lower, notably 1-2, 2-3, 2-7, 
3-7, and 6-8, but just what this signifies is difficult to say, except that 
the factors affected by weighting are not such as to give the extreme 


TaBLE III.—CokFFICIENTS OF CORRELATION BETWEEN VARIOUS TESTS IN THE 
Army ALPHA Trest WHEN UsING Raw Scores aND WHEN USING 
WEIGHTED SCORES 











| Pi 
Tests | 1 2 3 | 6 7 
ne apice! mi | i i i +c 
| | 
ree ae 
ee +, 604 
i CS re . 330 .518 
ee 422 .469 
6. ee eee es .507 .58 .50 
weight........... | 481 721 499 
7. raw Pi cot 587 731 673 639 
weight........... 617 673 673 643 
ie ae Ul 542 291 485 
weight.... ...... 442 453 | 65 23 559 











cases any marked advantage, or agreement. The remaining tests 
alone show an increase of coefficient through weighting, and this is 
not a notably large change in any case. In the Army Alpha Test then 
we find greater evidence of the validity of the weighting process in 
point of its influence on the scores, and a clear example of the differen- 
tiation of tests in respect of demand for weighting, but it must be 
remembered that these data cannot determine whether the weighted 
or unweighted scores are the more reliable. In general the findings 
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here agree with the statement of Yerkes,' et al.: ‘Our original use of 
the sum of unweighted scores in all tests as the definition of the com- 
bined scale for the measurement of intelligence is not in error for all 
practical purposes in so far as the internal evidence of the system is 
relevant.” 

A third test was investigated, being applied to the same group of 
pupils, 44 cases. It was planned to use here a test which had had the 
items carefully weighted and scaled so that comparisons might be 
made between the simple unweighted scores, the weighted scores from 
the scaled values and the weighted scores as empirically determined. 

The test used was the collection of 200 analogies as standardized by 
R.Pintner.2 These analogies were carefully divided into five tests of 40 
analogies each so that the accumulated weighted values of each test 
would be the same and each analogy in each test would be set over 
against others of approximately the same scaled value in the other 
tests. 

The five tests were then applied to the pupils, without a definite 
time limit being used, and the results were scored by three methods; 
by the simple point score, by use of the weighted values assigned by 
Pintner, and by weightings as derived by the same method as in the 
preceding tests, each test group being weighted independently of the 
others. 

Intercorrelations of the five tests were worked out for each of the 
three sets of scores and the results are presented in Table IV. It is 
noteworthy that none of the intercorrelations is greatly altered by the 


TaBLE 1V.—INTERCORRELATIONS OF SCORES IN FrvE ANALOGY TESTS WITH THE 
Use or THREE DIFFERENT METHODS oF SCORING 


UNWEIGHTED ScALE OF EMPIRICAL 

Test Parrs PoInts PINTNER WEIGAT 
1-2 .897 . 857 . 829 
1-3 . 863 . 846 .861 
1-4 . 733 . 665 .621 
1-5 .717 . 764 . 689 
2-3 . 755 .773 . 746 
2-4 .616 .516 . 502 
2-5 . 627 .700 .574 
3-4 . 759 . 687 .672 
3-5 . 753 .814 .731 
4-5 . 567 . 533 .514 


1 Psychological Examining in the United States Army. Memoirs National 
Academy, Vol. XV, No. 3, p. 651. 

2 Journal of Applied Psychology, Vol. 1V, Nos. 2 and 3, June-Sept., 1920, 
pp. 263-273. 
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method of scoring. In every case conclusions would be the same 
regardless of which set of scores was used. However, the correlations 
with the raw scores are found to be generally higher than those in which 
the weights are derived, showing that the element of difficulty involved 
in any analogy is not so common as to be detected by weighting. 

A typical comparison is here given in Table V of Rankings of the 
Subjects in one of these tests by the three methods of grading. There 
is a marked agreement in the three sets of ranks, in fact the variations 
are largely due to the comparatively large groupings of frequencies in 
the first set as compared with the other two. The extreme cases, 
especially, are in very close agreement. 


TABLE V.—SHOWING COMPARISON OF RANKINGS OBTAINED BY THREE DIFFERENT 
METHODS OF ScorRING A TEST 


(A) Unweighted; (B) Pintner Scale; (D) Derived weights 











| 
| 
Rankings | Rankings 
Subject | |__|} Subject | 
number | | number | 
nia oe | A B C 
| | | | Z 
oats 9 : | | : 
1 20% | 18 | 2 || 2 30144 | 24% | 22 
2 304 | 31 | 30% | 24 44 | 5 | 4% 
3 2 | 2% | 2% | 25 2244 | 27 | 26 
4 48 | 48 | 48 | 26 9 64 7 
5 9 | 9% | 7 | 2 | 22% | 16 | 14 
6 1414 i373) | 6| 13)0~CiW||:Ctl 8 40 404 3914 
7 1414 17 i | 29 2 1 1 
s 144 144 «17 | 30 42 42 42 
9 224% | 27 | 22 | 31 37 38 3744 
10 9 94 | 10% | 32 2 | 2% 216 
11 22146 | 2446 | 284 | 33 3044 | 32 32 
12 41 | 40% | 41 | ~~ 34 37 39 3914 
13 9 94% | 10% | 35 224% 29% | 22 
14 2234 | 22 | 26 | 36 . 9 914 1014 
15 1414 4% | 17 | #=%37 | 9 | 6% 7 
16 9 | 12 | 10% | 388 | 22% 19 17 
17 224% | 20 | 28% | ~~ 39 34 83 3414 
18 22% | 22 | 2 #+| 40 | 37 | 3% ~~ 36 
19 406,44 06¢~C;—C«CA a |] 41g 
20 34 «| (34s 88 | 42 | 304% | 29% 30% 
21 22% | 27 | 26 | 43 #=%| 3&4 =| 3% | 34% 
22 39 | 37 | 37% | 44 | 22% | 22 «| «(17 
| | 
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It is not to be doubted that weighting has a definite part to play in 
test construction and use, but it is quite as certain that this part must 
be more clearly defined than at present. It is possible to spend a great 
deal of time and effort in the computation and use of weighted scores 
with no adequate or comparable return in the improved differentiation 
of pupils in an average group. It is possible that weighting will serve 
its best purpose in assisting in a scaled arrangement of material, even 
though the items be not scored by different values, but this and other 
possibilities must be determined through a more rigid analysis of data 
and the thorough testing of fundamental assumptions involved in the 
process. 
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INFORMATION AS A MEASURE OF INTELLIGENCE 
AND MATURITY 


WILLIAM R. WILSON 
University of Washington 


The difference between tests that set out to measure intelligence 
more or less directly by measuring adjustments to novel situations, and 
those tests that are wholly or in part tests of maturity or experience 
has been recognized for 20 years. It has been pointed out many times 
that tests designed to discover differences in intelligence are valid 
measures only when the individuals composing the group examined are 
homogeneous as to past training and familiarity with the materials 
that compose the test. Such tests as color naming, giving dates, defi- 
nitions, vocabulary, spelling, giving the value of stamps and coins, and 
general information, are regarded with some suspicion as likely to be 
unduly influenced by mere length of residence in the world. When 
these suspected tests are given to children of different ages there is 
found a fairly uniform increase in ability to handle them adequately 
as age increases. What part of this increase in ability is due to growth 
of intelligence with maturation and what part is due simply to further 
experience is not known. 

When subjects of the same age and experience are to be compared 
with respect to general mental ability, the inclusion of the information 
type of test is scarcely open to criticism, as mental maturity and 
information show characteristic concomitant variation from year to 
year. However, little of our group testing in schools is of this kind. 
In the larger part of mental testing in school the individuals compared 
are usually homogeneous as to school grade rather than as to age or 
experience, and are often not even of the same school grade. For 
example, a test of the general type of the Army Alpha is given to all 
the pupils of a four-year high school, and the resulting scores are used 
as a basis for the comparison of the pupils as to general ability or what- 
ever function it is assumed that the test measures. These pupils range 
in age from 12 to 20 years, from childhood nearly to complete mental 
and physical maturity. The range of educational achievement is four 
years. In spite of these facts, on the average one-fourth of the ques- 
tions attempted in the Alpha test are in the general information section 
(Test 8) where ability to score may be largely contingent upon 
maturity rather than upon native ability. In comparing the Alpha test 
grade of a child of 12 with a man of 20, what allowance should be made 
for this fact? Or, more generally, if the Alpha test is given in a high 
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school what correction, if any should be made for the predominant 
weight given to the information test? 

In the study of the problems thus suggested the test papers of 168 
pupils of Grades IX to XII in the Grand Rapids (Minnesota) High 
School have been used. The tests (Alpha, Forms 6 and 7) were given 
in September, 1919. For purposes of comparison of two groups of 
different experience and maturity a test paper of the same total score 
as each high school paper was selected from the Alpha test blanks 
(Form 6) given to students in the University of Washington in March, 
1919. The results of this comparison are shown in Table I. 


TABLE I 
Higa ScHoou UNIVERSITY 

CI £0 oe i oe eg ak 168 168 
aS aa as Seda le 105.6 105 
Ee eee 20.1 21.7 
Mean age.. tak aieiace aed ak bs dae won 15.7 21.5 
Standard deviations 

ee deg ate gk og hegtalale 8 26.0 26.0 

0 ee ee a? ee P 9.9 7.2 

ETE Se i, Se ee eee 1.6 5.1 
Coefficients of correlation 

of total score and Test 8................ .69 + .027 .66 + .029 

of total score and age........... ...... 11 + .05 —.16 + .05 

of age and Test 8.. Weis cae veGa ca ane ee — .06 + .05 
Per cent Test 8 is of total NS ot. see tae 19.0 20.5 


These results show that with the groups used there was no undue 
influence of age or experience upon score in the information test. In 
both groups the correlation of age with total score and with information 
is, in effect, zero. In freedom from the influence of ages as such the 
information test suffers nothing as compared with the seven other tests 
of Alpha. 

In the interpretation of these results it must be borne in mind that 
the university students whose papers were selected for pairing with the 
high school pupils were obviously an inferior group. Seventy-four 
per cent of the total university body exceeded the mean score of this 
group of 168 students. This gives a comparison between normal high 
school students and somewhat dull university students. The varia- 
bilities of the two groups are sufficiently close, except in the case of age, 
to make possible a direct comparison of coefficients of correlation. As 
- all the correlations involving age are, in effect, zero the larger age varia- 
bility of the college group may be ignored. No coefficients of partial 
correlation have been worked out for these data because of the absence 
of correlation between age and the other two variables. 
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In Table II are shown the scores made by pupils of different ages. 
By computing the ratio of total score to Test 8, and using year 14 as 
the norm, the correlation that must be added to total score in order to 
have the information test contribute the same relative amount for 
each age has been computed. As the table shows, the general tendency 
is for the information test to contribute a relatively large amount to 
the scores of the younger pupils, so that, while the difference is very 
slight, they appear to be somewhat favored by the information 
questions. 








TaBLeE II 
| | | 
id Number Meantotal | Mean | Retin Pistilli 
of cases score | Test 8 | 

11-12 4 86.75 17.2 | 1988 —1.1 
“s 7 105.28 21.7 | 2062 —1.8 
14 32 100.06 18.9 | 1892 0 
15 32 104.03 20.3 | 1952 —0.7 
16 33 111.51 20.0 | .1801 1.0 
17 34 107.08 19.6 | 1837 0.6 
18 18 108.88 21.1 1998 —0.6 
19 4 102.75 16.7 .1630 2.7 





A negative correction indicates the amount by which total score is 
disproportionately large owing to the group doing relatively better 
than the 14-year group in Test 8. 

To discover the effect of maturity on Test 8, Alpha, within a uni- 
versity group a selection of 736 papers was made from the same testing 
as in the preceding study. The ratio of total score and information 
score was computed as before, and also corrections prepared as before, 
using the 18-year group as the norm. The correction is the amount 
that must be added algebraically to the total score of its corresponding 
age to eliminate the influence of age upon information. It will be 
noticed from Table III that the correction for the age between 17 and 
27 is less than 1 point of total score. For the oldest group, containing 
ages between 28 and 54, with the mean age 33, the correction is possibly 
large enough to be of significance. _ 

To discover whether information is influenced by school grade in 
any unusual way a random selection from the college testing previ- 
ously used was made. The results, which are tabulated in Table IV, 
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TaB_e III 
| 
Age Number Mean total Mean | ‘Cessestion 
of cases score Test 8 
17 64 141.1 28.3 . . 2007 .43 
18 111 132.8 27.0 . 2043 0 
19 111 136.0 28.5 . 2099 —0.92 
20 111 135.7 28.1 . 2069 —0.30 
21 11 | 129.3 27.2 .2109 —0.94 
22 111 | 134.9 28.3 . 2098 —0.91 
23 111 | = 182.6 27.4 2045 | -—0.32 
24 11 | ~—(134.5 28.1 2089 | —0.90 
25-27 63 | 1366 | 28.8 2106 | —0.94 
28-54 | 54 129.6 | 29.6 .2260 | —1.96 





show that in the case of each class the relative score contributed by 
Test 8 is approximately the same as for each other class. 


TABLE IV 
FRESHMEN SOPHOMORES JUNIORS SENIORS 
Number of cases............. 193 165 163 120 
TE ee 19.5 20.6 21.7 23.3 
Median total score............ 131.1 138.4 135.7 144.0 
ee 28.8 30.0 29.8 30.1 
Per cent Test 8 is of total score 21.1 21.6 21.9 20.9 


Insofar as these high school and college test papers are typical, an 
analysis of the scores indicates that the information test is subject to 
the same influences and to the same degree as the other tests in Alpha. 
Neither chronological age nor school grade appears to influence unduly 
the score in the general information test except for a possible slight 
advantage of a group of unusually old and rather slow college students. 
With this exception, when the Alpha grades of students of different 
ages and school grades are compared the correction for the influence of 
age and experience is so small as to be negligible. 

The high correlation of information tests with other measures of 
intelligence has made them valuable members of many batteries of 
tests. The ease with which an information test can be adjusted to a 
large number of needs by ‘“‘trial and error”’ selection from a large num- 
ber of questions makes it adaptable to many varied conditions. It is 
highly fortunate that the suspicion of these tests, based on common 
sense grounds, is unfounded in the case of group tests given to high- 
school and college students. 











AN EXTENSIVE EXPERIMENT IN MOTOR 
LEARNING AND RE-LEARNING 


S. R. BRADEN 


University of Missouri 


On April 25, 1921, an experiment in ball-tossing was begun and con- 
tinued for 100 days. A box which stood 8 inches high with a 5-inch 
circular hole in the top served as the target. Balls of solid rubber 
154 of an inch in diameter were used. A cloth pad was placed in 
the bottom of the box to prevent the balls from bouncing out. The 
distance between the throwing line and the box was 12 feet. The ex- 
periment consisted simply in throwing the balls into the box. The 
ultimate object of the experiment was to study the nature of the learn- 
ing capacity in trial and error learning when the learning is spread over 
a considerable period of time, and especially to note the course of the 
re-learning curve. 

Twenty-five balls were used. When these had been thrown they 
were gathered and thrown again until 200 balls had been thrown. This 
constituted one day’s practice. The score indicated only the balls 
which fell into the box. The experiment was begun on April 25, 1921 
and ended August 25, 1921. Nine o’clock in the morning was the hour 
for daily practice. Sunday was taken as a rest day. The experiment 
lasted for 100 days, so that 20,000 balls were tossed. The scores were: 
37, 25, 42, 45, 53, 51, 56, 55, 56, 60, 54, 65, 64, 65, 68, 69, 75, 79, 64, 
70, 68, 67, 74, 68, 74, 76, 76, 76, 80, 79, 78, 81, 79, 82, 84, 85, 78, 84, 
89, 85, 82, 88, 88, 87, 90, 88, 86, 86, 86, 89, 89, 90, 93, 92, 92, 90, 88, 
88, 94, 96, 90, 91, 89, 92, 88, 88, 87, 90, 88, 86, 88, 87, 91, 89, 91, 89, 
91, 91, 88, 88, 90, 89, 95, 93, 97, 98, 102, 98, 99, 102, 102, 102, 98, 91, 
94, 89, 94, 94, 91, 95. 

On March 5, 1923, 22 months and 11 days after the first experi- 
ment ended, a re-trial was carried on for 18 days under the same condi- 
tions as was the original experiment. The scores were: 40, 52, 56, 
58, 60, 70, 81, 85, 90, 102, 104, 105, 102, 102, 103, 103, 102, 101. 

Again on October 15, 1923, 6 months and 20 days later, a second 
re-trial was carried on for 18 days. The scores were: 60, 70, 74, 70, 
86, 84, 89, 92, 92, 96, 102, 107, 109, 107, 110, 110, 105, 105. 

The graph indicates clearly what the results were. The number of 
balls accurately thrown in the first 10 practices of the first experiment 
was 490; in the first re-trial it was 694; in the second re-trial it was 813. 
This gives an idea of the rapid rise of the curve in the re-trials. If we 
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reckon the total number of balls thrown for the first 18 practices of the 
first experiment we have 1029. The total for the first re-trial is 1516; 
and for the second re-trial is 1668. It is evident, therefore, that 
improvement was rapid in the retrials; and that the second re-trial 
shows a marked improvement over the first re-trial. 

Many interesting questions arise. Can one gain much more than 
50 per cent control in such a performance? Why, after one has thrown 
six balls successfully, does he miss the seventh? Is it better to use a 
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Graph 1 shows the daily record for 100 days’ ball-tossing. 
Graph 2 shows the daily record for 18 days’ ball-tossing 22 months after the first 
experiment was finished. 
Graph 3 shows the daily record for 18 days’ ball-tossing 6 months and 20 days 
after the second experiment was finished. 
The practices are represented on the horizontal axis and the scores on the vertical 
axis. 


simple arm movement, or to bring the entire body into a rhythmic 
sway before the ball is released? Could one teach others to toss the 
balls accurately; or must each learner, by the trial and error method, 
discover his own “best way?” | 

The experiment has to do with the vital elements of character. A 
high degree of concentration is required and it is easy to “let down.” 
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Perhaps more fruitful results would be obtained if all good throws 
counted according to a graduated scheme, as in target-shooting. This 
would tend to do away with discouragement, especially if the experi- 
ment were used for children. For the strictest of accuracy, only one 
ball should be used, which could be returned to the thrower by means 
of a trough. If balls of absolute identity in weight and size can be 
secured, of course they would be preferable to the one ball with the 
trough. The experiment can be made as difficult or as easy as is 
desired by changing the throwing distance. Since only inexpensive 
apparatus is required, it is hoped that many others will perform this 
experiment. It can be used either in home or school and throws much 
light upon improvement and re-learning. 
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INTELLIGENCE TESTS 


13:77 versus 12:05; a Study in Probable Error. Maurice B. Hexter and Abra- 
ham Myerson. Mental Hygiene, 1924, January, 69-82. A critical discussion 
of Brigham’s “A Study of American Intelligence”’ from the point of view of race 
discrimination and propaganda. 

The Army Tests and the Pro-Nordic Propaganda. Wm. C. Bagley. Edu- 
cational Review, 1924, April, 179-187. Challenges Brigham’s interpretations 
and assumptions based on army test data in his book ‘“‘A Study of American 
Intelligence.”’ Proposes in contrast, ‘‘the constructive program of the rational 
equalitarian.”’ 

The History of Mental Testing. Kimball Young. The Pedagogical Semi- 
nary, 1924, March, 1-48. Summarizes the history of psychology in Germany, 
France, England, and America with brief mention of the names, dates, and work 
of the men whose contributions are outstanding. 

The Mental Test as a Psychological Method. Lewis M. Terman. Psychological 
Review, 1924, March, 93-118. Discusses the value of the mental test in relation 
to the whole science of psychology. 

A Psychological Description of Intelligence. Raymond H. Wheeler. Psycho- 
logical Review, 1924, March, 161-174. Defines intelligence as ‘‘derived covert 
behavior”’ on four levels of complexity; localization, recognition, comprehension, 
and interpretation. 

Intelligence Tests versus Entrance Examinations as a Means of Predicting 
Success in College. Ada Hart Arlitt and Margaret Hall. The Journal of Applied 
Psychology, 1923, December, 330-338. A three years’ study of scores on entrance 
examinations, college grades, and scores on intelligence tests of 305 college students 
shows entrance grades to be the best measure on which to predict success in 
college. 


EDUCATIONAL TESTS 


The New Type Examinations in Physics. H.W. Farwell. School and Society, 
1924, March 15, 315-322. Illustrates the use of true-false, completion, and 
multiple choice tests in physics as satisfactory supplements to the old type of 
examination. 

The Relation between Reading Ability as Measured by Certain Standard Tests 
and the Ability Required in the Interpretation of Printed Matter Involving Reason. 
B. T. Ritter and W. T. Lofland. The Elementary School Journal, 1924, March, 
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529-546. Statistical data support the contention that there are different types 
of reading ability requiring specialized types of training. 

The Reliability of Measurements in Spelling. R.L. Morton. The Journal of 
Education Method, 1924, April, 321-328. Questions the grade standards and 
the assigned difficulties of words in the Ayres Scale. Data secured from 2863 
children in cities in Ohio. 

Fundamental Issues in Evaluating Composition. Joseph C. Gainsburg. The 
Pedagogical Seminary, 1924, March, 55-77. Points out the inadequacies and 
errors in composition scales and suggests a list of criteria for judging a composition. 

The Reliability of Tests Requiring Alternative Responses. Wm. Asker. Journal 
of Educational Research, 1924, March, 234-240. The present method of scoring— 
rights minus wrongs—does not eliminate the effect of guessing in individual cases. 
Guessing should be discouraged. 


Case STUDIES 


Reginald. Ross D. Murphy. The Psychological Clinic, 1924, January, 154- 
156. Reports the study of a twelve-year-old boy who is ‘‘nervously a wrecked 
machine which refuses to function.”’ 

James, A Study of a Post-encephalitic Child. Rose Wimenitz. The Psycho- 
logical Clinic, 1924, January, 144-148. Details of the study with the prognosis 
a question. 

The Spotless Child. Beatrice Green. The Psychological Clinic, 1924, January, 
149-153. The story of Marie, a middle grade imbecile whose mental condition 
is explained in part by a long series of illnesses very early in her life. 

What Are Little Boys Made of? Ellen M. Jordan. The Psychological Clinic, 
1924, January, 138-143. A case study of reading disability. 

The Superior Child. Alice M. Jones. The Psychological Clinic, 1924, Jan- 
uary, 130-137. Reports four case studies of children who have general intellectual 
superiority and marked musical ability. 

A Case of Secondary Mental Deficiency with Musical Talent. Blanche M. 
Minogue. The Journal of Applied Psychology, 1923, December, 349-352. 
Describes a male patient at Letchworth Village—chronological age 23, mental 
age about 7 years—who is a pianist of unusual ability. 

The Problem Child. Phyllis Blanchard and Richard H. Paynter, Jr. Mental 
Hygiene, 1924, January, 26-54. Reports the results of a study of 500 school 
children considered problems at home or at school. Medical, psychiatric, and 
psychological examinations were given and a social history of each child obtained. 
Seven case studies are reported in detail. 

Problem of the Quasi-delinquent in the School. Dorothy Wallace. Mental 
Hygiene, 1924, January, 115-165. What schools can do and are doing to prevent 
delinquency. Illustrated by case studies. 


New TEstTs 


Rhode Island Intelligence Test. G. E. Bird and C. E. Craig. Group test for 
children from three to six years of age. Public School Publishing Company. 
Bloomington, Illinois. 

New Standardized Oral Reading Check Tests. William S. Gray. Supplied 
in pairs of 100 tests for testing 20 individuals five times. Set 1, Grades I and II. 
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Set II, Grades II-IV. Set III, Grades IV-V. Set IV, Grades VI-VIII. Price 
of each set including direction sheets and record sheets $1.50. Public school 
Publishing Company, Bloomington, Illinois. 

Stevenson Problem Analysis. P. R. Stevenson. For training in analysis of 
word problems in arithmetic. Tentative norms. Public School Publishing 
Company, Bloomington, Illinois. 

Pressey Test in Diagnostic Composition. S. L. Pressey, Mrs. E. U. Bowers, 
Helen Ruhlen and F. R. Conkling. Separate tests for Capitalization, Punctuation, 
Grammar, and Sentence structure. Public School Publishing Company, Bloom- 
ington, Illinois. 

Iowa Physics Test. Dr. Harold L. Camp. Heat, Mechanics, Electricity, and 
Magnetism. Two forms foreach test. Tentative norms. Public School Publish- 
ing Company, Bloomington, Illinois. 

Chemistry Test Epsilon. 8. G. Rich. Second Form. Twenty-six items with 


multiple choice answer. Public School Publishing Company, Bloomington, 
Illinois. 


MISCELLANEOUS 


The Relation of Mental to Physical Growth. William Filler Lutz. The Psy- 
chological Clinic, 1924, January, 125-129. Reports the results of mental and 
physical examinations of 100 boys. 

The Conservation of Talent. Lewis M. Terman. School and Society, 1924, 
March 29, 359-364. A summary of the results to date of an investigation of 
1000 superior children. Objective data along anthropometric, medical, educa- 
tional, social and psychological lines are being collected and the histories of the 
children are to be followed into adult life as far as possible. 

Diagnostic and Cumulative Surveys of School Achievement in Minnesota. M. J. 
Van Wagenen. 1924, March, 147-168. Points out and illustrates significant 
features of school surveys which are of importance to grade principals and school 
superintendents. 

Sex Differences and Sex Variability in Learning Capacity. W.H. Pyle. School 
and Society, 1924, March 22, 352. Studies of boys’ and girls’ learning capacity 
in card sorting and digit-letter substitution show the girls’ scores to be consistently 
higher. Learning of this type improves with age. 

Selection in “Trial and Error’”’ Learning. William R. Wilson. Psychological 
Review, 1924, March, 150-160. Explains the origin and principles of selection 
of movements in trial and error learning in terms of familiar processes. 

The Franz Dot Tapping Test as a Measure of Attention. Irene Case Sherman. 
The Journal of Applied Psychology, 1923, December, 353-359. Discusses the 
use of the Franz Dot Tapping Test with 172 unselected cases. Results are com- 
pared with chronological age, mental age, and IQ. Six tables present statistical 
data. 

Organizing Classes for Gifted Children. Herbert D. Bixby. Educational 
Review, 1924, April, 196-199. Discusses the selection of gifted children for 
special classes, the importance of a superior type of teacher, and the ways in 
which the public can aid the work. 

Prevention of the Misfits. Margaret S. McGill. The Pedagogical Seminary, 
1924, March, 49-54. Emphasizes the need of practical educational advantages 
and competent vocational guidance for foreigners. 
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A Study of Kraepelins’ Continuous Substraction Test. ‘The Journal of Abnormal 
Psychology and Social Psychology, 1924, January-March, 385-388. Trial with 
113 unselected children show this test to be applicable as an attention test only 
for the normal adult. 

Two Hundred Cases of Dementia Praecor Tested by the Stanford Revision. 
Mary M. Wentworth. The Journal of Abnormal Psychology and Social Psy- 
chology, 1924, January—March, 378-384. Results of tests with dementia praecox 
patients indicate volitional and affective disorders rather than intellectual defects 
and emphasize the importance of individual study and treatment at an early 
stage of the disease. 

The Influence of Latin on the Spelling of English Words. Warren W. Coxe. 
Journal of Educational Research, 1924, March, 223-233. An experiment con- 
ducted with five groups of classes each following a different procedure shows 
that the study of Latin has an appreciable effect on English spelling, especially 
when rules governing the spelling of Latin derivatives are developed. 

Reducing the Variability in Teachers’ Marks. E. J. Ashbaugh. Journal of 
Educational Research, 1924, March, 185-198. Reports an experiment designed 
to show student teachers the great variability in teachers’ grading. Suggests 
clear-cut definition of what is being marked and agreement as to the relative 
value to be assigned to the various elements as a means of improvement, 

The Memory Value of Charts in College Lecture Courses. Donald A. Laird. 
School and Society, 1924, March 29, 381-382. The use of charts with a lecture, 
unless directly referred to and elaborated upon, is not effective. 

Some Whys of Whims—A Genetic Study. Garry Cleveland Myers. The 
Pedagogical Seminary, 1924, March, 78-83. Notes on the behavior of a child 
showing that whims are apparently just habits. 
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EDUCATIONAL PsYCHOLOGY SIMPLIFIED 


The Child’s Mind and the Common Branches, by Daniel Wolford 
La Rue. New York: Macmillan, 1924. Pp. X + 483. 


‘Practical and positively helpful, is the motto I have placed before 
myself in writing this book,” announces Dr. La Rue in the opening 
sentence of his introduction. For once a motto has been effective, 
and the result is an elementary textbook in educational psychology 
that teachers in training at normal schools should be able to read with 
comprehension and later to consult with hopefulness. | 

Part I (118 pages) sketches the physiological basis of mental 
activity, develops the concept of bonds, and discusses the three laws 
of Readiness, Exercise and Effect as formulated by Professor Thorn- 
dike.: The remainder of the book makes application of this matter 
to the teaching of the common branches. Inquiry is made as to the 
desirable bonds to be formed in each subject, the proper order of their 
formation, and the practical significance of the laws of learning in 
determining method. The ideas presented are, for the most part, in 
accord with the findings of modern research, and are traceable to their 
sources in excellent reference lists at the ends of chapters. In the case 
of reading alone, where oral training is stressed, is there any marked 
difference in emphasis from that of recent expert treatments. 

An engagingly personal style and diction, together with an abun- 
dance of homely, pointed illustration, is calculated to hold the reader’s 
interest and quicken his understanding. Many supplementary 
exercises are suggested to illuminate and fix more securely the princi- 
ples enunciated in the text. There are, of course, obvious limitations 
to the value of such a book for the mature student of education; but 
for the beginner, the young high school graduate just entering the 
normal school, its spirit and method seem especially well conceived. 

M. H. WILLING. 
320 
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A TEACHING HELP FoR EpuUCcATIONAL PsyCcHOLOGY 


Syllabus for a First Course in Educational Psychology, by F. B. Knicut 
and G. M. Rucu. Iowa City, Iowa: The Iowa Supply Co., 
1924. 


Teachers of educational psychology who use Gates’ ‘ Psychology 
for Students of Education” will welcome a recent syllabus designed to 
accompany this text. The authors have divided the material of the 
text into 30 lessons. For each lesson the assignment has been given in 
detail accompanied by notes explaining or calling attention to difficult 
points. Detailed discussions of the material for which the student 
should be held accountable are included in each lesson. The notes on 
the lesson show evidence of much thought and a knowledge of the 
difficulties with which students in beginning educational psychology 
meet. The points upon which the pupils are expected to recite are 
well selected. 

The only apparent weakness in the syllabus is the lack of references 
to supplementary reading material. 

Cari L. HUFFAKER. 
University of Arizona. 





Tue EFrrect or ATTITUDE ON WOMEN’S PuysIcAL FREEDOM 


Woman’s Physical Freedom, by Clelia D. Mosher. New York: The 
Woman’s Press, 1923. Pp. 87. 


In its refutation of the fallacious psychology of sex limitations 
which has been disseminated among women by tradition and precept, 
Dr. Mosher’s admirably sane and scientifically sound treatment of 
women’s disabilities should prove a valuable contribution to the 
psychology of individual differences. 

The book is written from the medical standpoint and Dr. Mosher 
bases her conclusions on the study of more than 2000 university 
women, her observations having extended over a period of 30 years. 
Her thesis is, in brief, that the physical freedom of the woman of the 
present college generation “ with her perfectly working body and superb 
health” is a direct outcome of ‘‘an alteration of the morbid attitude”’ 
of women themselves toward factors which produce disabilities. 

The assumption that certain sex differences are due to original 
nature has influenced largely our educational theories and practices. 
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The author has shown that even these physical traits which have been 
thought to cause differences in behavior, achievement and mental 
activities, are themselves largely determined (functionally) by ideas. 
Attitudes imposed by tradition and fostered by training account for 
some differences which have been considered fundamental. 

Emancipation from the deleterious effects of regarding functional 
periodicity in the light of a physical disability—as though digestive 
disturbances were to be regarded as normal and inevitable—the 
increase in height (the average height of college women today is 1.2 
inches above the average height 30 years ago) and size of waist, resulting 
from freedom from earlier constrictions of dress and the greater 
physical vigor, together with the increased muscular activity of the 
modern athletic woman constitute important factors in woman’s 
physical freedom which in a ‘‘world of unlimited opportunities” will 
enable ‘“‘the woman of the rising generation to answer the question of 
what woman’s real capacities are.”’ 

It is significant that Dr. Mosher, in pointing out that it rests with 
the woman alone whether she rejects this new physical freedom, 
clinging to the old ideal of weakness and dependence, places first 
“the opportunities of making herself a better citizen,” that is, of using 
“the opportunities for her self-development—interesting and worth- 
while living, service to the race, economic and civic—in other words, 
to have a life crowded full of things worth having.”’ 

The book is specially commended to students and teachers of edu- 
cational psychology. It presents a point of view fundamentally 
sound and significant in the investigation of the direct share of sex 
in the production of mental differences. 

Maup A. MERRILL. 


TYPICAL LABORATORY EXPERIMENTS IN THE PSYCHOLOGY OF LEARNING 


A Laboratory Manual in the Psychology of Learning, by William Henry 
Pyle. Baltimore: Warwick and York, 1923. Pp. 161. 


‘This manual is published in the hope of furthering and facilitating 
experiments in educational psychology and is the outcome of 14 years 
of experimentation.” 

In the first chapter the nature, aims and method of experimentation 
are presented in a clear, simple and forceful way which no one seems to 
have excelled. The first few pages are particularly good. 
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Thirteen pages are given to “‘Mathematical Treatment of Data”’ 
covering briefly the learning curve, frequency surface, measures of 
central tendency, measures of variability, and correlations, with a 
small amount of illustrative materials. In the next 120 pages the 
author lists experiments in a rather elaborate and effective manner, 
stating the object, materials, method and the results. Ball-tossing 
and mirror-drawing are given as trial and error types of motor learning 
followed by a card sorting and a marble sorting experiment. Then 
come the familiar digit-symbol experiment to illustrate semi-motor 
learning. Next the card sorting experiment is adopted to measure 
‘‘retention of motor learning” and to “study inhibition.” 

Following these are experiments in which the tachistoscope is used. 
The spot pattern test is used to measure perception span. Serial 
learning is studied with non-sense syllables and meaningful words. 
In an experiment on associative learning, paired associates are used. 
Visual and auditory methods of rate and ideational learning are 
compared. 

If the purpose is to master some technique, manipulate data and 
laboratory apparatus apart from any definite bearing upon everyday 
problems of life and learning this is a pretty good manual. 

It seems a pity however, that the author does so little to prevent the 
innocent student from thinking that all learning curves are of the con- 
cave type or that Professor William Henry Pyle has had to struggle 
alone for these 14 years to create out of nothing this group of rather 
ingenious experiments. Nevertheless, for models of simple statement 
of technique, any one who desires to write up a series of laboratory 
experiments in psychology will do well to turn to this little manual by 
Professor Pyle. 

GARRY CLEVELAND MYERs. 

Cleveland School of Education. 


YERKES-BRIDGES PoINT SCALE REVISED 


1923 Revision: A Point Scale for Measuring Mental Ability by Robert 
M. Yerkes and Josephine C. Foster. Baltimore: Warwick and 
York, 1923. Pp. VII + 219. 


The original edition of this book described in a series of papers the 
point scale method, the use of the original Pre-Adolescent Scale, and 
the revision based on an analysis of the results. In the final chapter 
of that volume the authors proposed to develop a universal scale based 
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on three perfected limited-range scales—the Pre-Adolescent, the 
Adolescent-Adult, and the Infant or Child Scale. The present volume 
presents these three scales. The Pre-Adolescent scale is essentially 
as in the first edition and the two supplementary scales are submitted 
merely in tentative form. 

The material has been entirely reorganized and rewritten. The 
chapters of the first edition which dealt with the history of the method, 
the initial form of the scale, and the results upon which the Pre-Adoles- 
cent Scale was based are omitted. A brief description of the Point 
Scale Method and a comparison with the Binet Scales is followed by 
directions for giving the Pre-Adolescent Scale. In this chapter the 
rewriting, which follows closely Terman’s style in ‘‘The Measurement 
of Intelligence,”’ has been particularly effective. The standardized 
directions stand out clearly in italics and additional directions make 
clear to the examiner exactly what is to bedone. Many illustrative 
responses have been added and these, together with the detailed 
‘“‘remarks”’ on practically every test, aid the examiner materially in 
the interpretation of a performance. ‘The results of the original scale 
are summarized in one chapter and norms based on these results are 
recommended for use as standards of comparison. They are repeated 
at the end of the book for ready reference, together with the printed 
materials necessary in the giving of the test. The Adolescent-Adult 
and the Infant Scales are described and full directions for their use are 
given. Examiners who feel the need of more adequate measures of 
the years above twelve or thirteen or below seven will be interested in 
using these scales critically and in aiding in their revision. 

An especially interesting part of the book is the section given to a 
discussion of ‘‘The Binet versus the Point Scale Method of Measuring 
Intelligence.” A careful analysis of the two scales shows how radically 
they differ on fundamentally important points, and points out the 
advantages in each scale which might help to make a ‘‘new scale 
infinitely superior to anything Binet or his successors have imagined.”’ 

Crecit M. Co.uoron. 





A TIMELY SURVEY OF THE FIELD OF HEARING 


Hearing by Robert M. Ogden. New York: Harcourt, Brace and Co., 
1924. Pp. XIII + 351. 


This volume has been many years in the making, and is the work of 
an authority whose intimate association with the work in this field, as 








ed 
wi 
ta 
an 
tic 
fie 


an 
an 


ra 
ar 


m 
ut 


in’ 


ps 
an 


in 
in 
qu 





\w OES 


— re 


~ OB OF NSN VSS & 





New Publications 325 


editor in charge of the department of the Psychological Bulletin dealing 
with the sense of hearing, eminently qualifies him for such an under- 
taking. 

It has the unique distinction of being the only work of its kind in 
any language, representing a systematic interpretation and organiza- 
tion of the increasingly large number of experimental studies in this 
field, as well as that of being a lucid and trustworthy account of its 
many ramifications. The book comprises a discussion of the physics 
and physiology of hearing, an analysis of tone, vocal sound and noise, 
and the parts they play in music and language, localization of sound, 
pathology of hearing and aural education. Notwithstanding the 
range of treatment, which ordinarily would lead to superficiality and 
arbitrariness, one is favorably impressed with the author’s attempt to 
marshall the significant facts and to maintain an attitude of open- 
mindedness on all debatable issues. The extensive list of sources, 
utilized and specifically referred to, is an outstanding feature. 

One must appreciate the fact that the author has kept the varied 
interests of a wide range of readers constantly in mind. Consequently, 
this contribution should prove an indispensable aid, not only to the 
psychologist. and otologist, but to the teacher and student of music 
and phonetics in the solution of his special problems as well. 

J. KUDERNA. 


CURRICULUM-MAKING FOR CHINA 


Education for Modernization in China, by Peng Chun Chang. New 
York: Teachers College, Columbia University, 1923. Pp. 92. 


The task to which the author of this book addresses himself is the 
formulation, through analysis and constructive types of reasoning, 
of a plan for curriculum revision in China’s middle schools. China 
has great need of leaders in her effort to become modern. The present 
tendency toward undiscriminating imitation of the forms and materials 
of Western secondary education is not likely to meet this need success- 
fully. To be safe, it is necessary first to answer such qustions as these: 
What are the significant characteristics of this modernism that China 
so much desires? What influences produced them? How can these 
influences, probably adventitious in the West, be consciously generated 
in Chinese schools so as to develop in prospective leaders the essential 
qualities of the modern spirit? 
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The author goes to the foremost analysts of the day for an interpre- 
tation of modernism. He concludes from this research that scientific 
method, individuality, and democracy (the three chief characteris- 
tics of Western modernism) have been the direct outgrowths of the 
“exploring, pioneering, frontier community-building, large-scale pro- 
ducing, and the like” incident to Expansion in the last four or five 
centuries. On the basis of this hypothesis he next sets up criteria for 
the selection of curriculum materials (activities) that will serve in a 
Chinese environment as substitutes for the influences identified in the 
expanding life of the Western nations, particularly of America. 
Finally he reaches the point of offering suggestive outlines of projects, 
formulated subject matter, and types of social organization among 
pupils. These outlines are purely tentative and intended only for the 
initial direction of experimentation. 

The study as a whole is a remarkably fine illustration of con- 
structive thinking, very clear in its progressions and not too far 
removed from practical needs and possibilities. It would be hard to 
find a more thorough or consistent application of the educational philos- 
ophy of Professor Dewey and the Columbia group to the exigent 
problem of curriculum revision not only in China but in America as 
well. 

M. H. WILLING. 


A HANDBOOK ON READING METHODS 


How to Teach Reading, by Mary E. Pennell and Alice M. Cusack. 
New York: Houghton Mifflin, 1923. 


Based on the theory that, for ordinary purposes, a teacher needs to 
know the results and not the processes of psychological research, Pen- 
nell and Cusack’s new book on reading bridges the gap between the 
laboratory and the classroom. The conclusions of the psychologists 
are stated briefly and without technical language. The results of 
investigations concerning the work of the eye in reading are admirably 
summed up. The authors consider the laws of learning only as they 
apply to reading. Throughout, the book is based upon sound psycho- 
logical principles, which it accepts as a major premise and from which it 
proceeds deductivity. How to Teach Reading is distinctly the product 
of persons who understand the problems of the classroom. A chapter 
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on ‘‘ Reading Stimuli’? makes specific and usable suggestions about 
greetings, illustrations, puzzles, chart stories, etc. Various methods of 
keeping grade records, class and individual graphs are illustrated by 
simple diagrams which any energetic teacher could use with a minimum 
of effort. 

In organization, the book is admirably suited to reference needs. 
As far as possible, outlines and summaries are used, so spaced that the 
eye quickly locates what is wanted. The waste of repetition is 
avoided by careful cross references which make all related material 
easily available. 

The writers have great confidence in the power of well framed 
“motive’’ questions to bring about a state of readiness for reading 
They also emphasize the functions of questions which demand judg- 
ment and organization. The use of these three types of questions in 
carefully selected reading units for the elementary grades is well 
illustrated throughout the book. 

One of the great contributions of recent studies in the psychology 
of reading has been to show that reading is a complex of many specific 
abilities. General training or specific training along one line will not 
result in adequate control of all of these abilities but will facilitate in 
some cases and inhibit in others. This fact implies that careful 
analyses must be made. (1) Of the specific abilities involved; (2) 
the ways and means by which they develop, (3) and the peculiar reac- 
tions which result from over or under training. From this standpoint 
the material on diagnosis and remedial work is inadequate except for 
very elementary work. The section on the between recitation period 
will also be disappointing to many because it contains little new 
material and avoids any suggestions regarding the primary difficulty 
involved in children’s undirected work, namely that of providing self- 
checking devices for material of intrinsic value to the child. 

Because of the directness and simplicity with which the psychologi- 
cal principles are stated, and because of the wealth of illustrative 
material used, the book will give immediate and practical help to many 
teachers and will no doubt be used widely and successfully in normal 
schools as a starting point for discussions of this most complex problem, 
“How to Teach Reading.” 

Ruts I. Brown, 


Ev1zA OGLESBY. 
Teachers College, New York City. 
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